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Methods and Compositions for Treating Abnormal Cell Growth Related to 
Unwanted Guanine Nucleotide Exchange Factor Activity 



5 



10 

FIELD OF THE INVENTION 

This invention is in the field of nriolecular biology, and involves methods and 
compositions for regulating unwanted cell growth through the regulation of the activity 
of certain guanine nucleotide exchange factors. 

15 BACKGROUND OF THE INVENTION 

Ras is a member of a superfamily of GTPases that regulate diverse signaling 
pathways. Ras itself has been shown to be involved in regulating cell growth and 
differentiation (See, Buguski, M. S. and McCormick, F. (1993) Nature 366. 

643-654). A subfamily of Ras consists of Rho. Rac, and Cdc42. These GTPase have 

20 also been shown to be involved in regulating cell growth, particularly as relating to 

cellular transformation, as well as controlling the formation of focal contacts and 
alterations in the actin cytoskeleton which occur upon growth factor stimulation (See, 
Coso. O. A.. Chiariello. M., Yu. J.-C, Teramoio, H., Crespo, P.. Xu, N., Miki, T. and. 
GuLkind. J. S. (1995) C^// 81. 1137-1146: Hill, C. S,. Wynne. J. and Treisman. R. 

25 (1995)^// 81,1159-1170; Kozma. R.. Ahmed, S., Best, A. and Lim, L. (1995) Mo/, 

CelL BioL 15, 1942-1952: Minden, A.. Lin. A., Claret, F.-X., Abo, A. and Karin. M. 
(1995) C^// 81,1147-1157: Nobes, C D. and Hall. A. (1995) Q// 81,53-62: Olson, 
M, F.. Ashworth, A. and Hall. A. (1995) Science 269. 1270-1272). Common to all 
Ras family members is their ability to cycle between inactive (GDP bound) and active 

30 (GTP bound) states. In this regard, these GTPases act as molecular switches, capable 

of processing information and then disseminating that information to control a specific 
pathway. 

This property of cycling between GTP and GDP states has provided a means to 
identify and purify proteins which regulate the nucleotide state of Ras and Ras related 
35 GTPases. See. Boguski, M. S, and McCormick, F. (1993) Nature 366, 643-654. 
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By monitoring the hydrolysis of GTP to GDP. GTPase activating proteins 
(GAPS) have been characterized for many members of the Ras family. See. Boguski. 
M. S. and McCormick. F. (1993) Nature 366. 643-654: Barfod. E. T.. Zheng. Y.. 
Kuang. W.-J.. Hart. M. J.. Evans. T.. Cerione. R. A. and Ashkenaz. A. (1993) J. Biol. 
Chem. 268. 26059-26062: Lamarche, N. and Hall. A. (1994) Trends Genet. 10. 436- 
440: Cerione. R. A. and Zheng, Y. (1996) Cin reni Opinion in Cell Biology 8. 216- 
222. The latter reference provides a good discussion of the properties of those proteins 
that affect the guanine nucleotide state of Ras and Ras related proteins. Guanine 
nucleotide dissociation inhibitors (GDIs) were identified based on their ability to 
inhibit the dissociation of GDP. It has subsequently been determined that they also 
bind to the GTP state, inhibiting the intrinsic and GAP stimulated GTP hydrolysis. 
See. Boguski. M. S. and McCormick. F. (1993) Nature 366. 643-654. In general. 
GAPS and effectors have a high affinity for the GTP-bound state, while GDI proteins 
bind most tightly to the GDP-bound state. These properties have been exploited to 
15 purify effectors for Cdc42Hs ( See, Bagrodia. S.. Taylor. S. J.. Creasy. C. L.. 

Chemoff. J. and Cerione. R. A. (1995) J. Biol. Chem. 270. 22731-22737: Manser. E.. 
Leung. T.. Salihuddin. H.. Zhao. Z.-s. and Lim. L. {\99Ar)Nature 367. 40-46: Martin. 
G. A.. BoUag. G.. McCormick. F. and Abo. A. (1995) EMBO J. 14. 1970-1978). Ras 
(See. Moodie. S. A.. Willumsen. B. M.. Weber. M. J. and Wolfman. A. (1993) Science 
20 260. 1658-1661: Rodriguez- Viciana. P.. Warne. P. H.. Dhand. R.. Vanhaesebroeck. 

B.. Gout. I.. Fry, M. J.. Waterfield. M. D. and Downward. J. (1994) Nature 
370. 527-532) and Rho (See. Leung. T.. Manser. E.. Tan. L. and Lim. L. (1995) J. 
Biol. Chem. 270. 29051-29054: Watanabe. G.. Saito. Y.. Madaule. P.. Ishizaki. T.. 
Fujisawa. K.. Morii. N.. Mukai. H.. Ono. Y.. Kakizuki. A. and Narumiya. S. (1996) 
25 Science 271.645-648). An affinity approach has also been employed with Cdc42Hs- 

GTP and has led to the characterization of IQGAPl. a potential mediator for observed 
cyioskeletal events induced by Cdc42. See. Hart. M. J.. Callow. M.. Souza. B. and 
Polakis. P. (1996) EMBO J. 15. 2997-3005. 

A modification of this affinity approach can also be used to identify and purify 
guanine nucleotide exchange factors (GEFs). GEFs can be distinguished from other 
regulatory proteins by their ability to interact preferentially with the nucleotide- 
depleted state of G-proteins. See. Hart. M. J.. Eva. A.. Zangrilli. D.. Aaronson. S. A.. 
Evans. T., Cerione. R. A. and Zheng, Y. (1994) J. Biol. Chem. 269. 62-65: Mostelier. 
. R. D.. Han. J. and Broek. D. (1994) Mol. Cell. Biol. 14. 1104-1 1 12. By stimulating 
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the dissociation of GDP and subsequent binding of GTP, GEFs play an important role 
in the activation of Ras-like proteins. For exaniple, Ras is converted to its GTP-bound 
form by the growth-factor stimulated translocation of Sos. a Ras-specific GEF. See, 
Buday. L. and Downward. J. (1993)Ce// 73, 61 1-620, 
5 The characterization of GEFs that specifically activate Rac family members 

will help elucidate signalling pathways in which these GTPases participate, and thus 
lead to a better understanding of the molecular basis of cell growth. This, in turn, will 
enable the identification of drugs for preventing or treating diseases where 
uncontrolled cell growth is the cause. Because Rac plays a key role in signal 

1 0 transduction and cell growth, the identification and properties of Rac GEFs is presently 

receiving considerable scientific attention. One such Rac GEF is known. Tiam- 1 . 
See, Michiels, F., Habets, G.G., Stant, J.C., van der Kammen, R.A., and Coiiard, 
J.G. (1995) Nature 375, 338-340. See also. Eva. A. and Aaronson. S. A. (1985) 
Nature 316. 273-275: Toksoz. D. and Williams, D. A. (1994) Oncogene 9, 621-628. 

1 5 DESCRIPTION OF THE INVENTION 

The present invention relates to all aspects of a guanine exchange factor (GEF), 
in particular, a Rac-GEF. A GEF modulates cell signaling pathways, both in vino and 
in vivo, by modulating the activity of a GTPase. By way of illustration, a Rac-GEF. 
which modulates the activity of a Rac GTPase, is described. However, the present 

20 invention relates to other GEFs, especially other Rac-GEFs. 

The present invention preferably relates to an isolated Rac-GEF polypeptide 
characterized by having a Src homology, Dbl homology and pleckstrin homology 
domains, and variants thereof, or fragments of such polypeptides, nucleic acids coding 
for such Rac-GEFs or nucleic acid fragments, and derivatives of the polypeptides and 

25 nucleic acids. 

The invention also relates to methods of using such polypeptides, nucleic acids, 
or derivatives thereof, e.g., in therapeutics, diagnostics, and as research tools. 

Another aspect of the present invention involves antibodies and other ligands 
which recognize the invention Rac-GEF, regulators of Rac-GEF activity and other 

30 GEFs, and methods of treating pathological conditions associated or related to such 

Rac GTPase. 

The invention also relates to methods of testing for and/or identifying agents 
which regulate GEF by measuring their effect on GEF activity, e.g., in binding lo a 
GTPase and/or nucleotide exchange activity. 

3 
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The inveniion also relates lo methods of assaying for GEF activity, preferrably 
using activators of GEF activity. 

These and other aspects of the invention will become apparent upon a full 
considertion of the following disclosure. 
5 BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 shows the complete nucleotide sequence (SEQ ID N0:1) and deduced 
amino acid sequence (SEQ ID N0:2) for a polypeptide encoded for by a human GEF- 
Rac gene. 

FIG. 2 shows the brain specific nucleotide sequence for a Rac-GEF. 
10 FIG. 3 shows the domain structures of full length Tiam-I. and truncations of 

the molecule. 

FIG. 4 shows the stimulatory effect of ascorbyl stearate on Rac exchange 
activity by various forms of truncated Tiam-1. the 85kd and 135 kD molecules. 

FIG. 5 shows the effects of certain ascorbyl compounds, inositol lipids and 
15 phospholipids on Tiam-1 stimulated Rac-GEF activity. 

FIG. 6 shows the effects of ascorbyl sterate on Tiam- 1 constructs that have PH 
and DH domains. 

DETAILED DESCRIPTION OF THE INVENTION 
Ir\ accordance w^ith the present invention, a novel polypeptide and 
20 nucleic acid coding for a Rac-GEF has been identified and isolated. Alternate 

variants of the molecule have also beeen identified. As used herein, Rac-GEF 
means a polypeptide, or a nucleic acid coding for a Rac-GEF polypeptide, 
which polypeptide has a specific binding affinity for a guanine nucleotide- 
depleted state of G-proteins (in particular Rac), a guanine nucleotide exchange 
25 activity, an oncogenic transforming activity, and an immunogenic activity. By 

specific binding affinity, it is meant that the polypeptide has a binding 
preference for the nucleotide-depleted state of the G-protein, in contrast, e.g., 
to the GDP- or GTP-bound state of the G-protein v^hich is preferentially bound 
by other regulatory proteins. By guanine nucleotide exchange activity, it is 
30 meant that the polypeptide stimulates or catalyzes the dissociation of GDP 

from a G-protein, such as Rac, and subsequent binding of GTP. By cellular 
oncogenic transforming activity, it is meant that introduction of a nucleic acid 
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coding for Rac-GEF into a cell line, e.g., NIH 3T3 cells, confers a transformed 
phenotype on such cells. A transformed phenotype can be measured by foci 
formation, e.g., as characterized and described by Eva and Aaronson, Nature, 
316:273-276, 1985. Immunogenic activity means that the polypeptide binds to 
5 Rac-GEF specific antibodies or is capable of eliciting an immune response 

specific for a Rac-GEF. Immunogenic activities are discussed below. The 
above-mentioned activities of a Rac-GEF polypeptide can be assayed, e.g., as 
described below in the examples or according to methods which the skilled 
worker would know. A Rac-GEF polypeptide, or corresponding nucleic acid 

10 coding for it, means a polypeptide which can be isolated from a natural 

source. It therefore includes naturally-occurring normal and mutant alleles. 
Natural sources include, e.g., living cells obtained from tissues and whole • 
organisms, and cultured cell lines. 

To identify a human gene that encodes a Rac-GEF, we performed a 

15 search of the EST da.a base for Dlb homologs. The search v^as performed 

using an amino acid sequence (residues 1-519) encoded by the human TIM 
protein (Chan et al., 1994, Ocogene, Vol. 9, pages 1057-1063). A single clone 
was identified, and the plasmid encoding this insert was purchased via the 
I.M.A.G.E. Consortium (Research Genetics). Using this cDNA as template, a 

20 511-bp 'T-labelled PCR product was produced using oligos 

5^-GGAGGCCATGTTCGAGCTGG-3' and 

5'- GCTGATCATCTGTTCCGTGC-3' (5' and 3' primers, respectively) and 
labelled nucleotides. This labeled PCR fragment was used as a probe to screen 
approximately 4 x 10* clones of a human fetal brain Lambda ZAP cDNA 

25 library (Stratagene). A clone with an insert of 2.6-kb was isolated, and the 

complete DNA sequence of this clone was determined and shown to have a 
single open reading frame of 1950-bp that is predicted to encode a 650-amino 
acids protein with a calculated molecular mass of 74.7 kDa. A comparision of 
the DNA sequences of the EST insert to the fetal brain cDNA revealed a 72 

30 base pair insert in the fetal brain sequence. The insert is in the DH donaain. 
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As discussed more in the Examples, Northern analysis revealed a 3.5 kb 
transcript in brain tissue and a 4 kb transcript in liver. Consequently^ using 
the additional sequence identified from the 2.6-kb sequence that was not 
present in EST #167059 we identified another EST (#109922) that had been 
5 isolated from a human cDNA liver library. The plasmid containing this insert 

was also obtained, and the insert sequenced which revealed an initiating 
methionine. 

Fig. 1 (SEQ ID NO(s): 1 and 2) show the alignment of the full length 
liver nucleotide cDNA sequence, with its deduced amino acid sequence, 

10 respectively. It is note worthy that this sequence has an additional 126 amino 

acids which differ from the amino-terminal 66 amino acids of the 2.6 kb brain 
cloned (Fig. 2). Also shown in the figure are various domains, including the 
Src homology 3, Dbl homology and pleckstrin homology domains. It, or its 
corresponding gene, can be isolated from natural sources. Characterization of 

15 a human Rac-GEF is described below and in the examples. 

It is noteworthy that because of the protein-protein interactive 
properties of the Src homology 3 domain, ligands that bind to this domain 
may be identified, for example, by screening an expression library, that affect 
Rac-GEF activity. Such ligands would have medical applications. 

20 The present invention also relates to polypeptide fragments of Rac- 

GEF. The fragments are preferably biologically-active. By biologically-active, 
it is meant that the polypeptide fragment possesses an activity in a living 
system or with components of a living system. Biological-activities include: a 
specific binding affinity for a guanine nucleotide-depleted state of G-proteins, 

25 in particular Rac, a guanine nucleotide exchange activity, an oncogeruc 

transforming activity, an immunogenic activity, modulating the binding 
between a Rac-GEF and a Rac GTPase, or acting as an agonist or antagonist of 
Rac GTPase activity. Such activities can be assayed routinely, e.g., according to 
the methods described above and below. Various fragments can be prepared. 

30 See the examples below for further discussion. Fragments can also be selected 

in which one or more of the mentioned activities are eliminated or altered 
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when compared to Rac-GEF. As described in tiie examples, such fragments 
can be prepared routinely, e.g., by recombinant means or by proteolytic 
cleavage of isolated polypeptides, and then assayed for a desired activity. 

The present invention also relates to a human Rac-GEF specific amino 
5 acid sequence as set forth in Fig. 1 (SEQ ID NO: 2): A clone encoding such 

sequence, 128 to 711 amino acids and also containing 66 divergent amino 
acids as shown in Figure 2, has been deposited on December 11, 1996 with the 
American Type Culture Collection with Accession No. 98273. A Rac-GEF 
specific amino acid sequence means a defined amino acid sequence. A specific 

10 amino acid sequence can be found routinely, e.g., by searching a gene/protein 

database using the BLAST set of computer programs. A Rac-GEF specific 
amino acid sequence can be useful to produce peptides as antigens to generate 
an immune response specific for Rac-GEF. Antibodies obtained by such 
immunization can be used as a specific probe for the Rac-GEF protein for 

15 diagnostic or research purposes. Such peptides can also be used to inhibit the 

Rac-GEF binding to Rac to modulate pathological conditions in cells. 

A polypeptide of the invention, e.g., having a polypeptide sequence as 
shown in Fig. 1 (SEQ ID NO: 2), can by analyzed by available methods to 
identify structural and/or functional domains in the polypeptide. For 

20 example, when the polypeptide coding sequence set forth in Fig. 1 (SEQ ID 

NO:2) is analyzed by computer algorithms, a continuous coding sequence 
comprising the following domains is identified: Src homology, Dbl homology 
and pleckstrin homology domains. Various programs can be employed to 
analyze structure of the polypeptide, including, EMBL Protein Predict; Rost 

25 and Sander, Proteins, 19:55-72, 1994; Kyte and Doolittle, J. Mol. Bio.: 157:105, 

1982. 

A polypeptide of the present invention can also have 100% or less 
amino acid sequence identity to the amino acid sequence set forth in Fig 1, 
(SEQ ID NO: 2). For the purposes of the following discussion: Sequence 
30 identity means that the same nucleotide or amino acid which is found in the 

sequence set forth in Fig 1. (SEQ ID NO: 1 and SEQ ID NO: 2) is found at the 
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corresponding position of the compared sequence(s). A polypeptide having 
less than 100% sequence identify to the amino acid sequence set forth in Fig. 1 
(SEQ. ID NO: 2) can be substihited in various ways, e.g., by a conservative 
amino acid. See below for examples of conservative amino acid substitution. 
The sum of the identical and conserved residues divided by the total number 
of residues in the sequence over which the Rac-GEF polypeptide is compared 
is equal to the percent sequence similarity. For purposes of calculating 
sequence identity and similarity, the compared sequences can be aligned and 
calculated according to any desired method, algorithm, computer program, 
etc., including, e.g., FASTA, BLASTA. A polypeptide having less than 100% 
amino acid sequence identity to the amino acid sequence of Fig. 1 (SEQ ID 
NO: 2) can comprise e.g., about 60, 65, more preferably, 67, 70, 78, 80, 90, 92, 
96, 99, etc. 

A Rac GEF polypeptide, fragment, or substituted GEF polypeptide can 

15 also comprise various modifications, where such modifications include 

glycosylation, covalent modifications (e.g., of an R-group of an amino acid), 

amino acid substitution, amino acid deletion, or amino acid addiHon. 

Modifications to the polypeptide can be accomplished according to various 

methods, including recombinant, synthetic, chemical, etc. 

A mutation to a Rac-GEF polypeptide can be selected to have a 

biological activity of Rac-GEF, e.g., a specific binding affinity for a guanine 

nucleotide-depleted state of G-proteins, in particular Rac, a guanine nucleotide 

exchange activity, an oncogenic transforming activity, and an immunogenic 

activity. The selection and preparation of mutations of Rac-GEF is discussed 
25 below. 

Polypeptides of the present invention (e.g., Rac-GEF, fragments 
thereto, mutations thereof) can be used in various ways, e.g., as immunogens 
for antibodies as described below, as biologically-active agents (e.g., having 
one or more of the activities associated with Rac-GEF), as inhibitors of Rac- 
GEF. For example, upon binding of Rac-GEF to Rac, a cascade of events is 
initiated in the cell, e.g., promoting cell proliferation and/or cytoskeletal 
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rearrangements. The interaction between Rac-GEF and Rac can be modulated 
by using a peptide fragment of Rac-GEF, e.g., a peptide fragment which is an 
inhibitor at the site where Rac-GEF interacts (e.g., binds) to Rac. Such a 
fragment can be useful for modulating pathological conditions associated with 
5 the Rac signaling pathway. A useful fragment can be identified routinely by 

testing the ability of overlapping fragments of the entire length of Rac-GEF to 
inhibit a Rac-GEF activity, such as guanine nucleotide exchange activity, 
binding to a guanine nucleotide depleted state of Rac. and oncogenic 
transforming activity. The measurement of certain of these activities is described 
10 below, and in the examples. These peptides can also be identified and 

prepared as described in EP 496 162. Peptides can be chemically-modified, 
etc. 

A polypeptide coding for a Rac-GEF polypeptide, or a derivative or 
fragment thereof, can be combined with one or more structural domains, 

15 functional domains, detectable domains, antigenic domains, and /or a desired 

polypeptides of interest, in an arrangement which does not occur in nature, 
i.e., not naturally-occurring, e.g., as in a normal Rac-GEF gene, a genomic 
fragment prepared from the genome of a living organism, e.g., an animal, 
preferably a mammal, such as human, mouse, or cell lines thereof. A 

20 polypeptide comprising such features is a chimeric or fusion polypeptide. 

Such a chimeric polypeptide can be prepared according to various methods, 
including, chemical, synthetic, quasi-synthetic, and /or recombinant methods. 
A chimeric nucleic acid coding for a chimeric polypeptide can contain the 
various domains or desired polypeptides in a continuous or interrupted open 

25 reading frame, e.g., containing introns, splice sites, enhancers, etc. The - 

chimeric nucleic acid can be produced according to various methods. See, e.g., 
U.S. Pat. No. 5,439,819. A domain or desired polypeptide can possess any 
desired property, including, a biological function such as catalytic, signalling, 
growth promoting, cellular targeting, etc., a structural function such as 

30 hydrophobic, hydrophilic, membrane-spanning, etc., receptor-ligand 

functions, and/or detectable functions, e.g., combined with enzyme, 

9 
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fluorescent polypeptide, green fluorescent protein GFP (Chalfie et al., 1994, 
Science, 263:802; Cheng etal., 1996, Nature Biotechnology, 14:606; Levy et al., 
1996, Nature Biotechnology, 14:610, etc. In addition, a Rac-GEF nucleic acid, or 
a part of it, can be used as selectable marker when introduced into a host cell. 
For example, a nucleic acid coding for an amino acid sequence according to 
the present invention can be fused in-frame to a desired coding sequence and 
act as a tag for purification, selection, or marking purposes. The region of 
fusion encodes a cleavage site. 

A polypeptide according to the present invention can be produced in an 
expression system, e.g., in vivo, in vitro, cell-free, recombinant, cell fusion, 
etc., according to the present invention. Modifications to the polypeptide 
imparted by such system include, glycosyiation, amino acid substituHon (e.g., 
by differing codon usage), polypeptide processing such as digestion, cleavage, 
endopeptidase or exopeptidase activity, attachment of chemical moieties, 
including lipids, phosphates, etc. For example, some cell lines can remove the 
terminal methionine from an expressed polypeptide. 

A polypeptide according to the present invention can be recovered from 
nahiral sources, transformed host cells (cultiare medium or cells) according to 
the usual methods, including, ammonium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, hydroxyapatite 
chromatography and lectin chromatography. It may be useful to have low 
concentirations (approximately 0.1-5 mM) of calcium ion present during 
purification (Price, et al., /. Biol. Chem., 244:917 (1969)). Protein refolding steps 
can be used, as necessary, in completing the configuration of the mature 
protein. Finally, high performance liquid chromatography (HPLC) can be 
employed for final purification steps. 

In accordance with the present invention, a nucleic acid coding for a 
Rac-GEF can comprise, e.g., the complete coding sequence as set forth in Fig. 1 
(SEQ ID NO: 1). A nucleic acid according to the present invention can also 
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comprise a nucleotide sequence which is 100% complementary, e.g., an anti- 
sense, to any nucleotide sequence mentioned above and below. 

A Rac GEF encoding nucleic acid according to the present invention can 
be obtained from a variety of different sources. It can be obtained from DNA 
or RNA, such as polyadenylated mRNA, e.g., isolated from tissues, cells, or 
whole organism. The nucleic acid can be obtained directly from DNA or 
RNA, or from a cDNA library. The nucleic acid can be obtained from a cell at 
a particular stage of development, having a desired genotype, phenotype (e.g., 
an oncogenically transformed cell or a cancerous cell), etc. 

A nucleic acid comprising a nucleotide sequence coding for a 
polypeptide according to the present invention can include only coding 
sequence of Rac-GEF; coding sequence of Rac-GEF and additional coding 
sequence (e.g., sequences coding for leader, secretory, targeting, enzymatic, 
fluorescent or other diagnostic peptides), coding sequence of Rac-GEF and 
15 non-coding sequences, e.g., untranslated sequences at either a 5' or 3' end, or 

dispersed in the coding sequence, e.g., introns. A nucleic acid comprising a 
nucleotide sequence coding without interruption for a Rac-GEF polypeptide 
nieans that the nucleotide sequence contains an amino acid coding sequence 
for a Rac-GEF polypeptide, with no non-coding nucleotides interrupting or 
intervening in the coding sequence, e.g., absent intron(s). Such a nucleotide 
sequence can also be described as contiguous. 

A nucleic acid according to the present invention also can comprise an 
expression control sequence operably lii\ked to a nucleic acid as described 
above. The phrase "expression control sequence" means a nucleic acid 
25 sequence which regulates expression of a polypeptide coded for by a nucleic 

acid to which it is operably linked. Expression can be regulated at the level of 
the mRNA or polypeptide. Thus, the expression control sequence includes 
mRNA-related elements and protein-related elements. Such elements include 
promoters, erihancers (viral or cellular), ribosome binding sequences, 
transcriptional terminators, etc. An expression control sequence is operably 
linked to a nucleotide coding sequence when the expression control sequence 
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is positioned in such a manner to effect or achieve expression of the coding 
sequence. For example, when a promoter is operably linked 5' to a coding 
sequence, expression of the coding sequence is driven by the promoter. 
Expression control sequences can be heterologous or endogenous to the 
normal gene. 

A nucleic acid in accordance with the present invention can be selected 
on the basis of nucleic acid hybridization. The ability of tu'o single-stranded 
nucleic acid preparations to hybridize together is a measure of their nucleotide 
sequence complementarity, e.g., base-pairing between nucleotides, such as A- 
T, G-C, etc. The invention thus also relates to nucleic acids which hybridize to 
a nucleic acid comprising a nucleotide sequence as set forth in Fig. 1 (SEQ ID 
NO: 1). A nucleotide sequence hybridizing to the latter sequence will have a 
complementary nucleic acid strand, or act as a template for one in the presence 
of a polymerase (i.e., an appropriate nucleic acid synthesizing enzyme). The 
present invention includes both strands of nucleic acid, e.g., a sense strand and 
an anti-sense strand. 

Hybridization conditions can be chosen to select nucleic acids which 
have a desired amount of nucleotide complementarity with the nucleotide 
sequence set forth in Fig. 1 (SEQ ID NO: 1). A nucleic add capable of 
hybridizing to such sequence, preferably, possesses 50%, more preferably, 70% 
complementarity, between the sequences. The present invention particularly 
relates to DNA sequences which hybridize to the nucleotide sequence set forth 
in Fig. 1 (SEQ ID NO: 1) under sh-ingent conditions. As used here, "stiringent 
conditions" means any conditions in which hybridization will occur where 
there is at least about 95%, preferably 97%, nucleotide complementarity 
between the nucleic acids. Such conditions include, e.g., hybridization for 
Northern: 5X SSPE, lOX Denhardts solution, 100 pg/ml freshly denatiired and 
sheared salmon sperm DNA, 50% formamide, 2% SDS at ^2°C; hybridization 
for cloning from cDNA hbrary: IX PAM, 0.1% SDS, 50% formamide at 420C. 

According to the present invention, a nucleic acid or polypeptide can 
comprise one or more differences in the nucleotide or amino acid sequence set 
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forth in Fig. 1 (SEQ ID NO: 1 and SEQ ID NO: 2). Changes or modifications to 
the nucleotide and/or amino acid sequence can be accomplished by any 
method available, including directed or random mutagenesis. 

A nucleic acid coding for a Rac-GEF according to the invention can 
comprise nucleotides which occur in a naturally-occurring Rac-GEF gene e.g., 
naturally-occurring polymorphisms, normal or mutant alleles (nucleotide or 
amino acid), mutations which are discovered in a natural population of 
mammals, such as humans, monkeys, pigs, mice, rats, or rabbits. By the term 
naturally-occurring, it is meant that the nucleic acid is obtained from a natural 
source, e.g., animal tissue and cells, body fluids, tissue culture cells, forensic 
samples. Naturally-occurring mutations to Rac-GEF can include deletions 
(e.g., a truncated amino- or carboxy-terminus), substihitions, or additions of 
nucleotide sequence. These genes can be detected and isolated by nucleic acid 
hybridization according to methods which one skilled in the art would know. 
It is recognized that, in analogy to other oncogenes, naturally-occurring 
variants of Rac-GEF include deletions, substitutions, and additions which 
produce pathological conditions in the host cell and orgarusm. 

A nucleotide sequence coding for a Rac-GEF polypeptide of the 
invention can contain codons found in a naturally-occurring gene, transcript, 
or cDNA, for example, e.g., as set forth in Fig. 1 (SEQ ID NO: 1), or it can 
contain degenerate codons coding for the same anuno acid sequences. 

In addition, a nucleic acid or polypeptide of the present invention can 
be obtained from any desired mammalian organism, but also non-mammalian 
organisms. Homologs from mammalian and non-mammalian organisms can 
be obtained according to various methods. For example, hybridization with 
an appropriate oligonucleotide selective for Rac-GEF can be employed to 
select such homologs, e.g., as described in Sambrook et al.. Molecular Cloning, 
1989, Chapter 11. 

Such homologs can have varying amounts of nucleotide and amino acid 
sequence identity and similarity to Rac-GEF. Non-mammalian organisms 
include, e.g., vertebrates, invertebrates, chicken, Drosophila, yeasts (such as 
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Saccharomxjces cerevisiae), C. elegans, roundworms, prokaryotes, plants, 
Arabidopsis, viruses, etc. 

Modifications to a Rac-GEF sequence, e.g., mutations, can also be 
prepared based on homology searching from gene data banks, e.g., Genbank, 
EMBL. Sequence homology searching can be accomplished using various 
methods, including algorithms described in the BLAST family of computer 
programs, the Smith-Waterman algorithm, etc. For example, conserved amino 
acids can be identified between various sequences, Dbl, Ibc, Ost, Isc, CDC24, 
etc. See, e.g., Touhara et al., J. Biol. Chem., 269:10217-10220, 1994; Toksoz and 
Williams, Oncogene, 9:621-628, 1994; Whitehead et al., J. Biol. Chem., 
271:18643-18650, 1996. A mutation(s) can then be introduced into a Rac-GEF 
sequence by identifying and aligning amino acids conserved between the 
polypeptides and then modifying an amino acid in a conserved or non- 
conserved position. A mutated Rac-GEF gene can comprise conserved or 
nonconserved amino acids, e.g., between corresponding regions of 
homologous nucleic acids, especially between Dbl homology (DH) domains, 
etc. For example, a mutated sequence can comprise conserved or non- 
conserved residues from any number of homologous sequences as mentioned- 
above and/or determined from an appropriate searching algorithm. 

Mutations can be made in specific regions of nucleic acid coding for the 
Rac-GEF polypeptide, e.g., in the Dbl homology domain, such as replacing it, 
changing amino acid sequences within it, etc., to analyze a function (e.g., 
oncogenic transformation, binding to a G-protein, guanine nucleotide 
exchange) of the polypeptide coded for by the nucleic acid. For example, 
deletion of the plecksh:in domain would result in the loss of oncogenic 
transforming activity. The plecksti-in domain can also be involved with lipid 
(e.g., phosphoinositides) binding, binding to Rac, activation of the guanine 
nucleotide exchange activity, and localization of the polypeptide in the cell. 
Thus, this region can be mutagenized according to various methods and then 
assayed for loss or gain of the mentioned functions. The DH domain is 
involved with promoting GDP dissociation from the Rac GTPase. Thus, 
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substitutions or deletions within this region can be prepared and assayed 
routinely for loss or gain of function. A mutation can be made in these or 
other regions of Rac-GEF which affect its phosphorylation or protein/lipid 
interaction leading to its modulation of the growth signaling pathway. Such a 
5 mutated gene can be useful in various ways: for diagnosis in patients having 

such a mutation, to introduce into cells or animals (transgenic) as a model for a 
pathological condition. Mutations which affect both GEF activity and 
transforming activity can be analogous to those made in the DH domain of the 
Dbl oncogene as described in Hart et al., T. Biol. Chem.. 269:62-65. 

1 0 A nucleic acid and corresponding polypeptide of the present invention 

include sequences which differ from the nucleotide sequence of Fig. 1 (SEQ ID 
NO: 1) but which are phenotypically silent. These sequence modifications 
include, e.g., nucleotide substihition which do not affect the amino acid 
sequence (e.g., different codons for the same amino acid), replacing naturally- 

15 occurring amino acids with homologous or conservative an ino acids, e.g., 

(based on the size of the side chain and degree of polarization) small nonpolar: 
cysteine, proline, alanine, threonine; small polar: serine, glycine, aspartate, 
asparagine; large polar: glutamate, glutamine, lysine, arginine; intermediate 
polarity: tyrosine, histidine, tryptophan; large nonpolar: phenylalanine, 

20 methiorune, leucine, isoleucine, valine. Such conservative substitutions also 

include those described by Dayhoff in the Atlas of Protein Sequence and 
Structure 5 (1978), and by Argos in EMBO T.. 8, 779-785 (1989). 

A nucleic acid can comprise a nucleotide sequence coding for a 
polypeptide having an amino acid sequence as set forth in Fig 1. (SEQ ID NO: 

25 2) except where one or more positions are substituted by conservative amino 

acids; or a nucleotide sequence coding for a polypeptide having an amino acid 
sequence as set forth in Fig 1. (SEQ ID NO:2). The invention also relates to 
polypeptides coded for by such nucleic acids. In addition, it may be desired to 
change the codons in the sequence to optimize the sequence for expression in a 

30 desired host. 
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A nucleic acid according to the present invention can comprise, e.g., 
DNA, RNA, synthetic nucleic add, peptide nucleic acid, modified nucleotides, 
or mixhires. A DNA can be double- or single-stranded. Nucleotides 
comprising a nucleic acid can be joined via various known linkages, e.g., ester, 
suifamate, sulfamide, phosphorothioate, phosphoramidate, 
methylphosphonate, carbamate, etc., depending on the desired purpose, e.g., 
resistance to nucleases, such as RNase H, improved in vivo stability, etc. See, 
e.g., U.S. Pat. Nos. 5,378,825. 

Various modifications can be made to the nucleic acids, such as 
attaching detectable markers (avidin, biotin, radioactive elements), moieties 
which improve hybridization, detection, or stability. The nucleic acids can 
also be attached to solid supports, e.g., nitrocellulose, nylon, agarose, 
diazotized cellulose, latex solid microspheres, polyacrylamides, etc., according 
to a desired method. See, e.g., U.S. Pat. Nos. 5,470,967, 5,476,925, 5,478,893. 

Another aspect of the present invention relates to oligonucleotides and 
nucleic acid probes. Such oligonucleotides or nucleic acid probes can be used, 
e.g., to detect, quantitate, or isolate a Rac-GEF nucleic acid in a test sample. 
Detection can be desirable for a variety of different purposes, including 
research, diagnostic, and forensic. For diagnostic purposes, it may be 
desirable to identify the presence or quantity of a Rac-GEF nucleic acid 
sequence in a sample, where the sample is obtained from tissue, cells, body 
fluids, etc. In a preferred method, the present invention relates to a method of 
detecting a Rac-GEF nucleic acid comprising, contacting a target nucleic acid 
in a test sample with an oligonucleotide under conditions effective to achieve 
hybridization between the target and oligonucleotide; and detecting 
hybridization. An oligonucleotide in accordance with the invention can also 
be used in synthetic nucleic acid amplification such as PGR, e.g., Saiki et al., 
1988, Science, 241:53; U.S. Pat. No. 4,683,202. 

Another aspect of the present invention is a nucleotide sequence which 
is unique to Rac-GEF. By a unique sequence to Rac-GEF, it is meant a 
defined order of nucleotides which occurs in Rac-GEF, e.g., in the nucleotide 
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sequence of Fig. 1 (SEQ ID NO: 1), but rarely or infrequently in other nucleic 
acids, especially not in an animal nucleic acid, preferably mammal, such as 
human, rat, mouse, etc. Both sense and antisense nucleotide sequences are 
included. A unique nucleic acid according to the present invention can be 
determined routinely. A nucleic acid comprising a unique sequence of Rac- 
GEF can be used as a hybridization probe to identify the presence of Rac-GEF 
in a sample comprising a mixture of nucleic acids, e.g., on a Northern blot. 
Hybridization can be performed under stringent conditions to select nucleic 
acids having at least 95% identity (i.e., complementarity) to the probe, but less 
stringent conditions can also be used. A unique Rac-GEF nucleotide sequence 
can also be fused in-frame, at either its 5' or 3' end, to various nucleotide 
sequences as mentioned throughout the patent, including coding sequences for 
other parts of Rac-GEF, enzymes, GFP, etc, expression control sequences, etc. 

Hybridization can be performed under different conditions, depending 
01 ■ the desired selectivity, e.g., as described in Sambrook et al.. Molecular 
Cloning, 1989. For example, to specifically detect Rac-GEF, an oligonucleotide 
can be hybridized to a target nucleic acid under conditions in which the 
oligonucleotide only hybridizes to Rac-GEF, e.g., where the oligonucleotide is 
100% complementary to the target. Different conditions can be used if it is 
desired to select target nucleic acids which have less than 100% nucleotide 
complementarity, at least about, e.g., 99%, 97%, 95%, 90%, 70%, 67%. Since a 
mutation in a Rac-GEF gene can cause diseases or pathological conditions, 
e.g., cancer, benign tumors, an oligonucleotide according to the present 
invention can be used diagnostically. For example, a patient having 
symptoms of a cancer or other condition associated with the Rac signaling 
pathway (see below) can be diagnosed with the disease by using an 
oligonucleotide according to the present invention, in polymerase chain 
reaction followed by DNA sequencing to identify whether the sequence is 
normal, in combination with other oncogene oligonucleotides, etc., e.g., p53, 
Rb, p21, Dbl, MTSl, Wtl, Bcl-1, Bcl-2, MDM2, etc. In a preferred method, the 
present invention relates to a method of diagnosing a cancer comprising 
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contacting a sample con^prising a target nucleic acid with an oligonucleotide 
under conditions effective to permit hybridization between the target and 
oligonucleotide; detecting hybridization, wherein the oligonucleotide 
comprises a sequence of Rac-GEF, preferably a unique sequence of Rac-GEF; 
and determining the nucleotide sequence of the target nucleic acid to which 
the oligonucleotide is hybridized. The sequence can be determined according 
to various methods, including isolating the target nucleic acid, or a cDNA 
thereof, and determining its sequence according to a desired method. 

Oligonucleotides according to the present invention can be of any 
desired size, preferably 14-16 oligonucleotides in length, or more. Such 
oligonucleotides can have non-naturally-occurring nucleotides, e.g., inosine. 
In accordance with the present invention, the oligonucleotide can comprise a 
kit, where the kit includes a desired buffer (e.g., phosphate, h-is, etc.), 
detection compositions, etc. The oligonucleotide can be labeled or unlabeled, 
with radioactive or non-radioactive labels as known in the art. 

Anti-sense nucleic acid can also be prepared from a nucleic acid 
according to the present, preferably an anti-sense to a coding sequence of Fig. 
1 (SEQ ID NO: 1). Antisense nucleic acid can be used in various ways, such as 
to regulate or modulate expression of Rac-GEF, e.g., inhibit it, to detect its 
expression, or for in sitii hybridization. For the purposes of regulating or 
modulating expression of Rac-GEF, an anti-sense oligonucleotide can be 
operably linked to an expression control sequence. 

The nucleic acid according to the present invention can be labelled 
according to any desired method. The nucleic acid can be labeled using 
radioactive hracers such as 32p, SSg, 125i^ 3h, or l^c, to mention only the most 
commonly used tiracers. The radioactive labelling can be carried out according 
to any method such as, for example, terminal labeling at the 3' or 5' end using 
a radiolabeled nucleotide, polynucleotide kinase (with or without 
dephosphorylation with a phosphatase) or a ligase (depending on the end to 
be labelled). A non-radioactive labeling can also be used, combining a nucleic 
acid of the present invention with residues having immunological properties 
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(antigens, haptens), a specific affinity for certain reagents (ligands), properties 
enabling detectable enzyme reactions to be completed (enzymes or coenzymes, 
enzyme substrates, or other substances involved in an enzymatic reaction), or 
characteristic physical properties, such as fluorescence or the emission or 
5 absorption of light at a desired wavelength, etc. 

A nucleic acid according to the present invention, including 
oligonucleotides, anti-sense nucleic acid, etc., can be used to detect expression 
of Rac-GEF in whole organs, tissues, cells, etc., by various techniques, 
including Northern blot, PGR, in situ hybridization, etc. Such nucleic acids 

10 can be particularly useful to detect disturbed expression, e.g., cell-specific 

and/or subcellular alterations, of Rac-GEF. The levels of Rac-GEF can be 
determined alone or in combination with other genes products (oncogenes 
such as p53, Rb, Wtl, etc.), transcripts, etc. 

A nucleic acid according to the present invention can be expressed in a 

15 variety of different systems, in vitro and in vivo, according to the desired 

purpose. For example, a nucleic acid can be inserted into an expression vector, 
introduced into a desired host, and cultured under conditions effective to 
achieve expression of a polypeptide coded for the nucleic acid. Effective 
conditions includes any culture conditions which are suitable for achieving 

20 production of the polypeptide by the host cell, including effective 

temperatures, pH, medias, additives to the media in which the host ceil is 
cultured (e.g., additives which amplify or induce expression such as butyrate, 
or methotrexate if the coding nucleic acid is adjacent to a dhfr gene), 
cyclohexamide, cell densities, culture dishes, etc. A nucleic acid can be 

25 introduced into the cell by any effective method including, e.g., calcium 

phosphate precipitation, electroporation, injection, DEAE-Dextran mediated 
transfection, fusion with liposomes, and viral transfection. A cell into which a 
nucleic acid of the present invention has been introduced is a transformed host 
cell. The nucleic acid can be extrachromosomal or integrated into a 

JO chromosome(s) of the host cell. It can be stable or transient. An expression 

vector is selected for its compatibility with the host cell. Host cells include, 
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mammalian cells, e.g., COS-7, CHO, HeLa, LTK, NIH 3T3, Rat 1 fibroblasts, 
yeast, insect cells, such as Sf9 (S. frugipeda) and Drosophila, bacteria, such as 
E. coli, Streptococcus, bacillus, yeast, fungal cells, plants, embryonic stem cells 
(e.g., mammalian, such as mouse or human), cancer or tumor cells. Sf9 
expression can be accomplished in analogy to Graziani et al.. Oncogene, 7:119- 
235, 1992. Expression control sequences are similarly selected for host 
compatibility and a desired purpose, e.g., high copy number, high amounts, 
induction, amplification, controlled expression. Other sequences which can be 
employed include enhancers such as from SV40, CMV, inducible promoters, 
cell-type specific elements, or sequences which allow selective or specific cell 
expression. 

In addition to a Rac-GEF nucleic acid, another gene of interest can be 
introduced into the same host for purposes of, e.g., modulating expression 
Rac-GEF, elucidating Rac-GEF function or that of the gene of interest. Genes 
of interest include other oncogenes, genes involved in the cell cycle, etc. Such 
genes can be the normal gene, or a variation, e.g., a mutation, chimera, 
polymorphism, etc. 

A nucleic acid or polypeptide of the present invention can be used as a 
size marker in nucleic acid or protein electrophoresis, chromatography, etc. 
Defined restriction fragments can be determined by scanning the sequence for 
restriction sites, calculating the size, and performing the corresponding 
restriction digest. For example, the Rac-GEF polypeptide from fetal brain can 
also be used as a molecular weight marker of about 74.7 kDa for a protein gel. 

Another aspect of the present invention relates to the regulation of 
biological pathways in which a GTPase is involved, particularly pathological 
conditions, e.g., cell proliferation (e.g., cancer), growth control, 
morphogenesis, stress fiber formation, and integrin-mediated interactions, 
such as embryonic development, tumor cell growth and metastasis, 
programmed cell death, hemostasis, leucocyte homing and activation, bone 
resorption, clot retraction, and the response of cells to mechanical stress. See, 
e.g., Clark and Brugge, Science, 268:233- 239, 1995; Bussey, Science, 272:225- 
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226, 1996. Thus, the invention relates to all aspects of a method of modulating 
an activity of a Rac polypeptide comprising, administering an effective 
amount of a Rac-GEF polypeptide or a biologically-active fragment thereof, 
an effective amount of a compound which modulates the activity of a Rac 
polypeptide, or an effective amount of a nucleic acid which codes for a Rac- 
GEF polypeptide or a biologically-active fragment thereof. The activity of Rac 
which is modulated can include: GTP binding, GDP binding, GTPase activity, 
integrin binding, coupling or binding of Rac to receptor or effector-like 
molecules (such as integrins, growth factor receptors, tyrosine kinases, PI-3K, 
PIP-5K, etc.). See, e.g., Clark and Brugge, Science, 268:233-239, 1995. The 
activity can be modulated by increasing, reducing, antagonizing, promoting, 
etc. of Rac. The modulation of Rac can be measured by assav for GTP 
hydrolysis, binding to Rac-GEF, etc. An effective amount is any amount 
which, when administered, modulates the Rac activity. The activity can be 
modulated in a cell, a tissue, a whole organism, in situ, in vitro (test tube, a 
solid support, etc.), in vivo, or in any desired environment. 

Compounds that regulate the interaction between a GEF, such Rac- 
GEF, and a GTPase can be identified using an assay for a GEF activity, such as 
guanine nucleotide exchange activity, binding to a guanine nudeotide- 
depleted site of a GTPase, or oncogenic ti-ansforming activity, or a GTPase 
activity such as GTP hydrolysis. In general, a compound having such an in 
vitro activity will be useful in vivo to modulate a biological patiiway associated 
with a GTPase, e.g., to h:eat a pathological condition associated with the 
biological and cellular activities mentioned above. By way of illusti-ation, the 
ways in which GEF regulators can be identified are described above and 
below in terms of Rac and Rac-GEF. However, it is to be understood that 
such methods can be applied generally to other GEFs. 

A guanine nucleotide exchange assay, e.g., as described in Hart et al.. 
Nature, 354:311-314, 28 Nov. 1991 (see, especially. Figure 2 legend therein), can 
be used to assay for the ability of a compound to regulate the interaction 
between Rac and Rac-GEF. For example, Rac protein (recombinant, 
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recombinant fusion protein, or isolated from natural sources) is labeled with 
tritiated-GDP. The tritiated-GDP-labeled Rac is then incubated with Rac-GEF 
and GTP under conditions in which nucleotide exchange occurs. The amount 
of tritiated-GDP that is retained by Rac is determined by separating bound 
GDP from free GDP, e.g., using a BA85 filter. The ability of a compound to 
regulate the interaction can be determined by adding the compound at a 
desired time to the incubation (e.g., before addition of a Rac-GEF, after 
addition of a Rac-GEF) and determining its effect on nucleotide exchange. 
Various agonist and antagonists of the interaction can be identified in this 
manner. For instance, an aspect of the instant invention is the discovery that 
certain compounds greatly enhance the activity of Rac-GEFs, and preferrably 
of the Rac-GEF, Tiam-1. Such compounds are hereinafter termed "GEF 
enhancers." Such compounds have certain similar chemical features including 
a hydrocarbon arm, preferrably consisting of substantially saharated bonds 
15 that link the carbon residues together, and also preferrably the number of 

carbon atoms should be between 12-22. A second feature of such compounds 
is the association of the hydrocarbon arm to either a 5 or 6 membered ring 
structure. Preferred 5 or 6 membered compounds include ascorbate and 
certain cydohexanes, respectively. The more preferred 5 membered 
compounds are derivatives of ascorbate, while the more preferred 
cydohexanes include insoitol. 

Binding to a guanine nudeotide-depleted site of Rac can be determined 
in various ways, e.g., as described in Hart et al., J. Biol. Chem., 269:62-65, 1994. 
Briefly, a Rac protein can be coupled to a solid support using various methods 
25 that one skilled in the art would know, e.g., using an antibody to Rac, a fusion 

protein between Rac and a marker protein, such as glutathione protein (GST), 
wherein the fusion is coupled to a solid support via the marker protein (such 
as glutathionine beads when GST is used), etc. The Rac protein is converted to 
a guanine nucleotide depleted state (for effective conditions, see, e.g.. Hart et 
ai., J. Biol. Chem., 269:62-65, 1994) and incubated witii, e.g., GDP, GTPyS, and 
a GEF such as Rac-GEF. The solid support is then separated and any protein 
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on it run on a gel. A compound can be added at any time during the 
incubation (as described above) to determine its effect on the binding of the 
GEF to Rac. 

The modulation of oncogenic transforming activity by a Rac-GEF, or 
5 derivatives thereof, can be measured according to various known procedures, 

e.g., Eva and Aaronson, Nature, 316:273-275, 1985; Hart et al., J. Biol. Chem., 
269:62-65, 1994, A compound can be added at any time during the method 
(e.g., pretreatment of cells; after addition of GEF, etc) to determine its effect 
on the oncogenic transforming activity of Rac-GEF. Various cell lines can also 
10 be used. 

Other assays for Rac-mediated signal transduction can be accomplished 
according to procedures known in the art, e.g., as described in U.S. Pat. Nos. 
5,141,851; 5,420,334; 5,436,128; and 5,482,954; W094/16069; W093/16179; 
W091/15582; WO90/00607. In addition, peptides which inhibit the 

15 interaction, e.g., binding, between Rac-GEF and a G-protein, such as Rac, can 

be identified and prepared according to EP 496 162. 

The present invention also relates to a method of testing for and 
identifying an agent which modulates the guanine nucleotide exchange 
activity of a guanine nucleotide exchange factor, or a biologically-active 

20 fragment thereof, or which modulates the binding between a Rac-GEF, or a 

biologically-active fragment thereof, and a GTPase, or a biologically-active 
fragment thereof, to which it binds. The method comprises contacting the GEF 
and GTPase with an agent to be tested and then detecting the presence or 
amount of binding between the GEF and GTPase, or an activity of the GEF 

15 such as guanine nucleotide exchange activity. By modulating, it is meant that 

addition of the agent affects tiie activity or binding. The binding or activity 
modulation can be affected in various ways, including inhibiting, blocking, 
preventing, increasing, enhancing, or promoting it. The binding or activity 
affect does not have to be achieved in a specific way, e.g., it can be 

50 competitive, noncompetitive, allosteric, sterically hindered, via cross-linking 

between the agent and the GEF or GTPase, etc. The agent can act on either the 
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GEF or GTPase. The agent can be an agonist, an antagonist or a partial 
agonist or antagonist. The presence or amount of binding can be determined 
in various ways, e.g., directly or indirectly by assaying for an activity 
promoted or inhibited by the GEF, such as guanine nucleotide exchange, GTP 
5 hydrolysis, oncogenic transformation, etc. Such assays are described above 

and below, and are also known in the art. The agent can be obtained and/or 
prepared from a variety of sources, including natural and synthetic. It can 
comprise, e.g., amino acids, lipids, carbohydrates, organic molecules, nucleic 
acids, inorganic molecules, or mixtures thereof. See, e.g., Hoeprich, Nature 

10 Biotechnology, 14:1311-1312, 1996, which describes an example of automated 

synthesis of organic molecules. The agent can be added simultaneously or 
sequentially. For example, the agent can be added to the GEF and then the 
resultant mixture can be further combined with the GTPase. The method can 
be carried out in liquid on isolated components, on a matrix (e.g., filter paper, 

15 nitrocellulose, agarose), in cells, on tissue sections, etc. In accordance with the 

method, a GEF can bind to the GTPase, which binding will modulate some 
GTPase activity. For example, as discussed above and below, a Rac-GEF binds 
to Rac, causing guanine nucleotide dissociation. The effect can be directiy on 
the binding site between the GEF and GTPase, or it can be allosteric, or it can 

20 be on only one component (e.g., on the GEF only). Assays for guanine 

nucleotide dissociation can be readily adapted to identify agents which 
regulate the activity of a GTPase. The method further relates to obtaining or 
producing agents which have been identified according to the above- 
described method. 

25 The present invention also relates to products identified in accordance 

with such methods. Various GEFs and GTPases can be employed, including, - 
Rac-GEF, mSOS, SOS, C3G, Isc, Dbl, Dbl-related proteins, polypeptides 
comprising one or more DH domains, CDC24, Tiam-1, Ost, Lbc, Vav, Ect2, 
Bcr, Abr, Rho (A, B, and C), Rac, Ras, CDC42, chimeras tiiereof, biologically- 

30 active fragments thereof, muteins thereof, etc. 

24 
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The present invention thus also relates to the treatment and prevention 
of diseases and pathological conditions associated with Rac-mediated signal 
transduction, e.g., cancer, diseases associated with abnormal cell proliferation. 
For example, the invention relates to a method of treating cancer comprising 
5 administering, to a subject in need of treatment, an amount of a compound 

effective to treat the disease, where the compound is a regulator of Rac-GEF 
gene or polypeptide expression. Treating the disease can mean, delaying its 
onset, delaying the progression of the disease, improving or delaying clinical 
and pathological signs of disease. Similarly, the method also relates to treating 
10 diseases associated with inflammation, and /or the chemotactic ability of 

neutrophils. A regulator compound, or mixture of compounds, can be 
synthetic, naturally-occurring, or a combination. A regulator compound can 
comprise amino acids^ nucleotides, hydrocarbons, lipids, polysaccharides, etc. 
A regulator compound is preferably a regulator of Rac-GEF, e.g., inhibiting or 
1.5 increasing its mRNA, protein expression, or processing, or its interaction with 

Rac, e.g., guanine nucleotide exchange. Additionally, cells can be 
supplemented with Rac-GEF, or derivatives thereof. To treat the disease, the 
compound, or mixture, can be formulated into pharmaceutical composition 
comprising a pharmaceutically acceptable carrier and other excipients as 
20 apparent to the skilled worker. See, e.g.. Remington's Pharmaceutical Sciences, 

Eighteenth Edition, Mack Publishing Company, 1990. Such composition can 
additionally contain effective amounts of other compounds, especially for 
treatment of cancer. 

The present invention also relates to antibodies which specifically 
25 recognize a Rac-GEF polypeptide. Antibodies, e.g., polyclonal, monoclonal, 

recombinant, chimeric, can be prepared according to any desired method. For 
example, for the production of monoclonal antibodies, a polypeptide 
according to Fig. 1 (SEQ ID NO: 2), can be administered to mice, goats, or 
rabbit subcutaneously and /or intraperitoneally, with or without adjuvant, in 
an amount effective to elicit an immune response. The antibodies can also be 
single chain or FAb. The antibodies can be IgG, subtypes, IgG2a, IgGl, etc, 

25 
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An antibody specific for Rac-GEF means that the antibody recognizes a 
defined sequence of amino acids within or including the Rac-GEF amino acid 
sequence of Fig. 1 (SEQ ID NO: 2). Thus, a specific antibody will bind with 
higher affinity to an amino acid sequence, i.e., an epitope, found in Fig. 1 (SEQ 
5 ID NO: 2) than to a different epitope(s), e.g., as detected and/or measured by 

an immunoblot assay. Thus, an antibody which is specific for an epitope of 
Rac-GEF is useful to detect the presence of the epitope in a sample, e.g., a 
sample of tissue containing Rac-GEF gene product, distinguishing it from 
samples in which the epitope is absent. Such antibodies are useful as 

10 described in Santa Cruz Biotechnology, Inc., Research Product Catalog, can be 

formulated accordingly, e.g., 100 pg/ml. 

In addition, ligands which bind to a Rac-GEF polypeptide according to 
the present invention, or a derivative thereof, can also be prepared, e.g., using 
synthetic peptide libraries, or nucleic acid ligands (e.g., Pitrung et al., U.S. Pat. 

15 No. 5,143,854; Geysen et aL, 1987, J. Immunol. Methods, 102:259-274; Scott et 

al., 1990, Science, 249:386; Blackwell et al., 1990, Science, 250:1104; Tuerk et al., 
1990, Science, 249: 505. 

Antibodies and other ligands which bind Rac-GEF can be used in 
various ways, including as therapeutic, diagnostic, and commercial research 

20 tools,. e.g, to quantitate the levels of Rac-GEF polypeptide in animals, tissues, 

cells, etc., to identify the cellular localization and/or distribution of Rac-GEF, 
to purify Rac-GEF or a polypeptide comprising a part of Rac-GEF, to 
modulate the function of Rac-GEF, etc. Antibodies to Rac-GEF, or a 
derivative thereof, can be used in Western blots, ELISA, immunoprecipitation, 

25 RIA, etc. The present invention relates to such assays, compositions and kits 

for performing them, etc. 

An antibody according to the present invention can be used to detect 
Rac-GEF polypeptide or fragments thereof in various samples, including 
tissue, cells, body fluid, blood, urine, cerebrospinal fluid. A method of the 

30 present invention comprises contacting a ligand which binds to a peptide of 

Fig 1, (SEQ ID NO: 2) under conditions effective, as known in the art, to 
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achieve binding, detecting specific binding between the iigand and peptide. 
By specific binding, it is meant that the ligand attaches to a defined sequence 
of amino acids, e.g., within or including the amino acid sequence of Fig 1. 
(SEQ ID NO: 2) or derivatives thereof. The antibodies or derivatives thereof 
5 can also be used to inhibit expression of Rac-GEF or a fragment thereof. The 

levels of Rac-GEF polypeptide can be determined alone or in combination 
with other gene products. In particular, the amount (e.g., its expression level) 
of Rac-GEF polypeptide can be compared (e.g., as a ratio) to the amounts of 
other polypeptides in the same or different sample, e.g., p21, p53, Rb, WTl, 
10 etc. 

A ligand for Rac-GEF can be used in combination with other 
antibodies, e.g., antibodies that recognize oncological markers of cancer, 
including, Rb, p53, c-erbB-2, oncogene products, etc. In general, reagents 
which are specific for Rac-GEF can be used in diagnostic and/or forensic 

15 skidies according to any desired method, e.g., as U.S. Pat. Nos. 5,397,712; 

5,434,050; 5,429,947. 

The present invention also relates to a labelled Rac-GEF polypeptide, 
prepared according to a desired method, e.g., as disclosed in U.S. Pat. No. 
5,434,050. A labelled polypeptide can be used, e.g., in binding assays, such as 

20 to identify substances that bind or attach to Rac-GEF, to track the movement 

of Rac-GEF in a cell, in an in vitro, in vivo, or in situ system, etc. 

A nucleic acid, polypeptide, antibody, Rac-GEF ligand etc., according 
to the present invention can be isolated. The term "isolated" means that the 
material is in a form in which it is not found in its original environment, e.g., 

25 more concentrated, more purified, separated from component, etc. An 

isolated nucleic acid includes, e.g., a nucleic acid having the sequence of Rac- 
GEF separated from the chromosomal DNA found in a living animal. This 
nucleic acid can be part of a vector or inserted into a chromosome (by specific 
gene-targeting or by random integration at a position other than its normal 

30 position) and still be isolated in that it is not in a form which it is found in its 

natural environment. A nucleic acid or polypeptide of the present invention 
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can also be substantially purified. By substantially purified, it is meant that 
nucleic acid or polypeptide is separated and is essentially free from other 
nucleic adds or polypeptides, i.e., the nucleic acid or polypeptide is the 
primary and active constituent. 
5 The present invention also relates to a transgenic arumal, e.g., a non- 

human-mammai, such as a mouse, comprising a Rac-GEF nucleic acid. 
Transgenic animals can be prepared according to known methods, including, 
e.g., by pronuclear injection of recombinant genes into pronuclei of 1-celI 
embryos, incorporating an artificial yeast chromosome into embryonic stem 

10 cells, gene targeting methods, embryonic stem cell methodology. See, e.g., 

U.S. Patent Nos. 4,736,866; 4,873,191; 4,873,316; 5,082,779; 5,304,489; 5,174,986; 
5,175,384; 5,175,385; 5,221,778; Gordon et al., Proc. Natl. Acad. Set, 77:7380-7384 
(1980); Palmiter et al.. Cell 41:343-345 (1985); Palmiter et al., Ann. Rev. Genet., 
20:465-499 (1986); Askew et al., Mol. Cell. Bio., 13:4115-4124, 1993; Games et al. 

15 Nature, 373:523-527, 1995; Valancius and Smithies, MoL Cell. Bio., 11:1402-1408, 

1991; Stacey et al., Mol Cell. Bio., 14:1009-1016, 1994; Hasty et al.. Nature, 
350:243-246, 1995; Rubinstein et al., NucL Acid Res., 21:2613-2617,1993. A 
nucleic acid according to the present invention can be introduced into any 
non-human mammal, including a mouse (Hogan et al., 1986, in Manipulating 

20 the Mouse Embryo: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold 

Spring Harbor, New York), pig (Hammer et al., Nature, 315:343-345, 1985), 
sheep (Hammer et al.. Nature, 315:343-345, 1985), cattle, rat, or primate. See 
also, e.g.. Church, 1987, Trends in Biotech. 5:13-19; Clark et al., 1987, Trends in 
Biotech. 5:20-24; and DePamphilis et al., 1988, BioTechniques, 6:662-680. In 

25 addition, e.g., custom transgenic rat and mouse production is commercially 

available. These transgenic animals are useful as a cancer model, e.g., to test 
drugs. 

Generally, the nucleic acids, polypeptides, antibodies, etc. of the present 
invention can be prepared and used as described in U.S. Pat. Nos. 5,501,969; 
30 5,506,133; 5,441,870; WO 90/00607; and WO 91/15582. 
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For other aspects of the nucleic acids, polypeptides, antibodies, etc., 
reference is naade to standard textbooks of molecular biology, protein science, 
and immunology. See, e.g., Davis et al. (1986), Basic Methods in Molecular 
Biology, Elsevir Sciences Publishing, Inc., New York; Hames et al. (1985), 
5 Nucleic Acid Hybridization, IL Press, Molecular Cloning. Sambrook et al.: Current 

Protocols in Molecular Biology, Edited by P.M. Ausubel et al.. John Wiley & Sons. 
Inc; Current Protocols in Human Genetics. Edited by Nicholas C. Dracopoli et al., 
John Wiley & Sons. Inc.: Current Protocols in Protein Science: Edited by John E. 
Coligan et al., John Wiley & Sons. Inc.: Current Protocols in Immunology: Edited by 
10 John E. Coligan et al., John Wiley & Sons, Inc. 

EXAMPLES 
Example 1 
Cloning of cDNA encoding Rac GEF 
A Dbl-homology domain containing protein was identified in a human fetal 
15 brain cDNA library as follows. A TBLASTN search of the dbEST database 

was performed using the amino acid sequence (residues 1-519) encoded by the 
human TIM protein (Chan et al, 1994, Oncogene, Vol. 9, pages 1057-1063). 
One EST clone, # 167059 was identified with high sequence homology to the 
TIM cDNA. The piasmid encoding this insert was purchased via the 
20 I.M.A.G.E, Consortium (Research Genetics). Using this cDNA as template, a 

511-bp "P-labelled PGR product was produced using oligos 
5'-GGAGGCCATGTTCGAGCTGG-3' and 

5'- GCTGATCATCTGTTCCGTGC-3' (5' and 3' primers, respectively) and ^^P 
labelled nucleotides. This labeled PGR fragment was used as a probe to screen 

25 approximately 4 x 10' clones of a human fetal brain Lambda ZAP cDNA 

library (Stratagene). A clone with an insert of 2.6-kb was isolated, and the 
complete DNA sequence of this clone was determined using an ABI sequencer. 
This 2.6-kb clone harbored a single open reading frame of 1950-bp that is 
predicted to encode a 650-amino acids protein with a calculated molecular 

30 mass of 74.7 kDa. However, this open reading frame is not full-length, as the 

initiating methionine is missing. This cDNA is on deposit with the American 
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Type Culture Collection, December 11, 1996, with the Accession No.98273, 
and is denoted p67 Rac-GEF. 

Northern analysis using the probe described above was conducted. The 
results revealed a 3.5 kb transcript specific to brain tissue and an additional 4 
kb tiranscript of lower abundance specific to liver tissue. Other normal tissues 
tested, including heart, placenta, lung, muscle, kidney, pancreas, spleen, 
thymus, prostate, testis, ovary, intestine, colon and peripheral blood 
lymphocytes were also essentially negative. In a preliminary screen of human 
tumor cell lines, abundant 3.5 kb mRNA levels were detected in the lung 
carcinoma cell line A549 and the colon carcinoma cell line SW480. Other 
tijmor cell lines were negative, including HL-60, HeLa, K-562, Molt-4, Raji and 
G-36. Further screerung of a number of primary tumor samples revealed over- 
expression in liver, lung and colon tiimors. 

Using the additional sequence identified in the 2.6-kb clone, further 
analysis of the dbEST database using the Blashi program identiiied an 
additional clone, # 109922, which had been isolated from a liver library. The 
plasmid encoding this insert was purchased via the I.M.A.G.E. Consortium 
(Genome Systems), and the sequence of the insert was determined. This 
sequence revealed an initiating methionine and 126 additional amino acids 
which differed from the amino-terminal 66 amino acids of the 2.6 kb brain 
clone described above. This new sequence most likely encodes the liver- 
specific alternatively spliced form which had been identified by Northern 
analysis. Pieced together with the previously determined sequence, this liver- 
derived sequence reveals an open reading frame of 2133-bp predicted to 
encode a 710-amino acid protein. 

In addition to the alternatively-spliced brain/liver isoforms, another 
putative splice variant was identified: an insertion of 72-bp coding for 24- 
amino acids within the Dbl homology region is encoded by the 2.6-kb brain 
clone. The sequence encoded by these 24-amino acids is conserved among 
other exchange factors including Tim (Chan et al., 1994, Oncogene, Vol. 9, 
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pages 1057-1063) and Vav2 (Henske et ah, 1995, Ann Hum Genet 59, Pt. 1, 
pages 25-37). 

Example 2 
Properties of Rac GEF 

5 Two Rac-GEFs were tested for guanine nucleotide activity. 

Firstly, a Glu-epitope tag (MEYMPMEIRHD) was engineered onto the 
carboxy-terminal 423 amino acids of the Rac GEF encoded by EST No. 167059 
by introducing the oligos 

5'-TCGAGGAGGTTATAAATATGGAATACATGCCAATGGA-3' and the 
1 0 complementary 

5'-AATTTCCATTGGCATGTATTCCATATTTATAACCTCC-3' into the 
XhoI/EcoRI sites of the clone. The protein encoded by this construct is 
referred to as Type I Rac-GEF. 

Next, the sequence encoding the insertion in the Dbl homology region, 

15 as described in Example 1, was en^dneered into the open reading frame in the 

expression plasmid pET21a (Novagen). The protein encoded by this construct 
is referred to as Type II Rac-GEF. The resulting expression plasmids were 
introduced into E. coli strain BL21(DE3)pLysS (Novagen), and the epitope 
tagged protein expression was induced with IPTG. The expressed proteins 

20 were purified using a resin with the antibody to the Glu-epitope covalently 

attached. The resulting proteins were partially pure and were assayed for 
exchange activity on Racl, RhoA and Cdc42. See, Hart, in U.S. Serial No. 
60/029,979, filed November 6, 1996. The results showed that Rac GEF is 
primarily selective for Racl, but also displays activity against both RhoA and 

25 Cdc42. Furthermore, the Type I form lacking the Dbl insert region is 

unaffected by the addition of the PH domain ligand ascorbyl stearate, while 
the Type II form contairung the Dbl insert region is strongly stimulated by 
ascorbvl stearate. 
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Example 3 
Immunochemical detection 
Antibody specific to Rac-GEF was raised in rabbits against three 
fragments of the purified recombinant molecule. The fragments correspond to 
5 amino acids 385-398, Type II and amino acids, 372-386 of Type I Rac-GEF 

referred to in Example 2, and 693-710 amino acids of Type 11. The peptides 
were coupled to KLH, and antibody raised in rabbits using standard 
procedures. 

Example 4 

10 Cloning and Expression of Tiam-1 and Truncations Thereof 

Cloning and expression of Tiam-1, and various Tiam-1 truncations, is 
described below. This work, and that shown in Examples 5 and 6, was 
undertaken to determine those regions of Tiam-1 that realize GEF enhancer 
stimulation of Rac GEF activity. 

15 cDNA Cloning Of Human Tiam- 1 and Tiam-1 Truncations : Primers designed 

against published mouse Tiam-1 cDNA sequence (See, Habets, G.G, Scholtes, 
E.H., Zuydgeest, D., van der Kammen, R.A., Stam, J.C., Berns, A., and Collard, 
J.G. (1994) Cell 77, 537-549; NCBI Gen Bank Accession #U05245) were used in 
PGR reactions using a human fetal brain library (Stratagene #936206) as 

20 template to obtain fragments of the human Tiam-1 gene which were 

radiolabled and used as probes in Southern hybridizations of the same library. 
Primer pairs used were both 
5'-CCATAAAACCATGGGAAACGC-3' and 
5'-GGTTCCGCGGAAGAGAAGGAT-3' with 

25 5'-GACTGGCCCGGGGAACTGAGG-3'; and 

5'-TCGGATGCGGATAAGCTGCGC-3' with 

5'-GTGACTGGCGACCTTGTTCAT-3'. Two partial clones of human Tiam-1 
cDNA were retrieved, one contained nucleotides (nt) 1-2972 and the other (nt) 
2972-4657 (numbering throughout corresponds to previously published Tiam- 
30 1 cDNA (See, Habets, G.G., van der Kammen, R.A., Stam, J.C., Michiels, F., 

and Collard, J. G. (1995) Oncogene 10, 1371-1376; NCBI Gen Bank Accession 
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#U16296). To obtain missing C-terminal sequences, a PGR reaction employing 
oligonucleotides designed against the human Tiam-1 cDNA (See, Habets, 
G.G., van der Kammen, R.A., Stam, J.C., Michiels, F., and Collard, J. G. (1995) 
Oncogene 10, 1371-1376; NCBI Gen Bank Accession #U16296) 5'- 
5 CGGAATTCAGATTTCGACACATGATC-3' (sense) and 

5'-TCGCCCGGGGCAGGTGACGCAGTCAGA-3' (antisense, contains Smal 
site downstream of stop codon) as primers and a human hippocampal library 
(Clontech #HL3023b) as template produced a fragment containing nt 4458-5366 
which was added to existing clones using the internal Eco47III (4487) site. A 

10 similar strategy using the antisense primer 

5'-GATCCCGGGTCATGTTTCTGGTTCTGGGATCTCAGTGTTCAGTTTCCTG-3' 
was used to add the KT3 epitope tag "PEPET," a stop codon, and a Smal site to 
the end of Tiam-1. To splice the two partial clones together, a PGR reaction 
using 5'-CGGAATTCCATGGGCCGCCTTGGAATCT-3' (sense) and 5'- 

15 TCGCCCGGGCGTCAGCAGCACGATTAT-3' 

(antisense) as primers, and a human fetal brain cDNA library (Clontech 
#HL50156) as template produced a product spanning nt 2422-3189 which was 
cloned into pBS SK+ (Stratagaene #212201) using EcoRI and Smal. Ncol (472) - 
Ncol (2422) and StuI (3134) -Smal fragments were ligated into this vector, 

20 creating full-length clones, with and without the KT3 tag. 

It is note worthy that the isolated Tiam-1 sequence was altered from the 
published sequence. The 5' clone obtained from the library contained an 
insert of sequence 5'- 

GGTGAGCAGTTTACACTTTCATATACTCCCTGTCATGTGCTTTGAAGGACTTTC 
25 TAGGGGCATCAAG-3' in the upstream non-coding region at nt 105. Original 

clones from the Stratagene fetal brain library as well as all PGR products from 
Clontech hippocampal and brain libraries contained a difference in sequence 
from the published Tiam-1 cDNA ((See, Habets, G.G., van der Kammen, R.A., 
Stam, J.C., Michiels, F., and Collard, J. G. (1995) Oncogene 10, 1371-1376; NCBI 
30 Gen Bank Accession #U16296); a G at nt 3005 instead of a C, which therefore 
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encodes a Gin in position 844 instead of a His. In addition, PGR introduced 
silent mutations G4739A and G5153A. 

Example 5 
Expression n f Tiam-1 and Truncatinns 
The following expression vectors were coristructed and used to express 
the appropriate Tiam-1 constructs. 

Full-length (178 kP) Tiam-1: A KT3-tagged 4792 basepair (bp) Ncol (472)- 
Smal fragment was ligated into NcoI-Smal-digested pAcG4 (See, Rubinfeld, 
B., et al. Gell 65, 1033-1042 (1991)): Bio /Technology 6:47-55 (derived from 
pAc436)). 

135 kP Tiam-1: The 5' phosphorylated oligonucleotides 
5'-GTCATGATGG-3' and 5'-TCCATCATGAGGGCC-3' were used as linkers to 
recircularize Apal - EcoNI (1673)-digested pBS SK+-baseci full-length Tiam-1. 
The linker-created BspHI site and the vector-derived Spel site were used to 
clone the 4006 bp fragment into Ncol-X bal-digested pAcC4 (See, Rubinfeld, 
B., et al. Cell 65, 1033-1042 (1991)). 

106 kP Tiam-1: The Ncol (472) - Ncol (2422) fragment was removed from the 
full-length pAcC4-based expression vector. 

85 kP Tiam-1: PGR using 

5'-CTTGAArrCCACCATGGAAATCTGTCCAAAAGTCACT-3' (sense) and 
5'-TCGCGCGGGCGTCAGCAGCAGGATTAT-3' (antisense) as primers and 
the Stratagene Tiam-1 nt 2972-4657 clone as template was used to create an 
NcoI-StuI (3134) fragment that placed an ATG before nt 2972. The 2297 bp 
Ncol-Smal was ligated into NcoI-Smal-digested pAcC4 (See, Rubinfeld, B., et 
al. Cell 65, 1033-1042 (1991)). 
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66 kP Tiam-1: The 5' phosphorylated oligonucleotides 

5'-CATGGACCAGAACCCATCTCC-3' and 5'-TGAGGAGATGGGTTCTGGTC-3' 
were used as linkers to recircularize Ncol (472) - Bsu36I (3534)-digested pBS 
SK+'based full-length Tiam-1. The linker-regenerated Ncol site and the 
5 vector-derived Spel site were used to clone the 1761 bp fragment into Ncol- 

Xbal-digested pAcC4 (See, Rubinfeld, B., et al. Cell 65, 1033-1042 (1991)). 

APH versi ons of Tiam-1 : The oligonucleotides 

5'-GCCAGAACCAGAAACATGAC-3' and 

5'-CCGGGTCATGTTTCTGGTTCTGGC-3' were used as linkers to 
1 0 recircularize Eco47III (4487) and Xmal-digested pAcC4-based expression 

vectors containing the 135 kD, 106 kD, 85 kD, and 66 kD versions of Tiam-1. 
These primers also restored the KT3 tag. 

GST-PH domain fusion proteins: Products from PGR reactions using Tiam-1 
cDNA as template and 

15 5'-GAGGAATTCGATCTGAGCATGGGAGACCTG-3' and 

5'-CTGCTCGAGCTACTTATCACGCAGGATTGAATG-3' (C-terminal PH 
domain) or 5'-CAGGAATTCGTGCGCAAGGCCGGCGCCCTG-3' and 
5'-GTGCTCGAGCTACGCAGTGGCGCAGGCAGAGTG-3' (N-terminal PH 
domain) as primers were cloned into pGEX20T (See, Helin, K., Harlow, E. and 

20 Fattaey, A. (1993) Mol. Cell. Biol. 13, 6501-6508) using EcoRI and Xhol. 

All Tiam-1 constructs, except for the GST fusions, were produced in 
baculoviurs-infected S. frugiperda -9 cells and were purified using KT3-mAb 
immunoaffinity chromatography. See, Schreurs, J., Yamamoto, R., Lyons, J., 
Munemitsu, S., Conroy, L., Clark, R., Takeda, Y., Krause, J.E., and Innis, M. 
15 (1995) /. Neurochem. 64, 1622-1631. GST fusion proteins were produced in 

E.coli and purified using glutathione-agarose. See, Smith, D.B. and Johnson, 
K.S, (1988) Gene 67, 31-40. 
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Example 6 

Stimulators of Tiam-1 Rac GEF Activity 

The GEF activity of the various Tiam-1 constructs described above was 
determined in the presence and absence of certain compounds. The following 
5 assay was utilized. Reactions were conducted at room temperature in Buffer 

A (20mM Hepes pH7.3, 50mM NaCl, 2mM DTT, 2mM MgC12). All proteins 
and compounds were diluted to 4x their final concentrations in Buffer A 
(GTPases were diluted in Buffer A containing lyM GDP). To dilute Ascorbyl 
Stearate, Ascorbyl Palmitate, and Stearic Acid, 25mM EtOH solutions were 

10 slowly added to Buffer A while vortexing vigorously. Other lipids were 

resuspended in aqueous solution with vortexing and bath sonication and then 
diluted into Buffer A. Reactions were prepared and at time 0, X-^^P-GTP 
(DuPont NEN #NEG006H) was added to 4.5nM, and after 10 minutes 
reactions were stopped by filtering onto nitrocellulose filters (Millipore 

15 #HAWP02500) and immediately washed with wash buffer (25mM Tris 7.5, 

lOOmM NaCl, 30mM MgC12). Bound X-''P-GTP was measured, using 
standard techniques. 

The 85 kD portion of human Tiam-1 protein was produced in insect 
cells and purified by affinity chromatography, as described above. This 

20 protein contained an intact PDZ domain, Dbl-homology (DH) domain and 

adjacent pieckstrin homology (PH) domain (Fig. 3). Using the above 
described assay, this truncation alone, at various concentrations, exhibited no 
GEF activity towards Rac 1 (Fig. 4). In contrast, ascorbyl stearate (AS) 
stimulated the rate of Tiam-l-mediated nucleotide exchange on Rac 1 (Fig. 4). 

25 Because AS has the potential to act as a detergent or a reducing agent, other 

detergents (nOG, Trition X-100, NP40) and reducing agents (DTT, TCEP, or 
Tris (2-carboxyethyl)phosphine) were tested and shown not to significantiy 
stimulate Tiam-1 GEF activity. Several other lipids were tested to determine 
the specificity of activation. Ascorbyl palmitate (AP), phosphatidylinositol-4- 
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phosphate (PI(4)P), and phosphatidylinositol-4,5-bisphosphate (PI(4,5)P2) 
significantly enhanced Tiam-1 activity; phosphatidylinositol-3,4,5- 
trisphosphate (PI(3,4,5)P3) and phosphatidylserine had weak effects; and 
phosphatidylglycerol, phosphatidylinositoi and phosphatidylcholine had little 
5 or no effect (Fig. 5). As a control, experiments were run to determine if IP3, 

IP2, ascorbic acid, stearic acid and ascorbic acid with stearic acid were 
sufficient to activate the GEF activity of Tiam-1. The results showed that these 
reagents were incapable of stimulating GEF activity (Fig. 5). 

It has been previously reported that Tiam-1 has GEF activity in the 

10 absence absence of lipids. See, Michiels, F., Habets, G.G., Stam, J.C., van der 

Kammen, R.A., and Collard, J.G. (1995) Nature 375, 338-340. Those studies 
used a mouse version containing additional upstream sequences, including the 
N-terminal PH domain and the coiled-coil region. To determine if upstream 
regions are necessary for expression of the DH domain GEF activity, the 

15 corresponding human construct was prepared (Fig. 3). This 135 kD Tiam-1 

truncation shows weak GEF activity towards Rac in the absence of AS, but is 
still greatly stimulated by AS (Fig. 4). Other truncations of Tiam-1, all 
containing the DH and PH domains (Fig. 3), also exhibited AS stimulated GEF 
activity on Rac 1 (Fig. 6). 

20 To determine if AS binding to the PH domain was responsible for 

activation of Tiam-1 GEF activity, sequences 3' of the Eco47III site were 
deleted, removing half of the C-terminal PH domain as well as the rest of the 
C-terminus (Fig. 3). These truncations of Tiam-1 were not activated by AS, 
including one that contained the N-terminal PH domain (Fig. 4). While it is 

25 possible that deleting the PH domain destroys activity of the DH domain 

altogether, similar truncations of the PH domain of the Dbl protein do not 
affect its GEF activity (See, Zheng, Y., Zangrilli, D., Cerione, R.A., and Eva, A. 
(1996). /. Biol. Chetn. 271, 19017-19020). To further determine if the PH 
domains could bind to AS, GST-PH fusion proteins were included in the 

30 reaction. While GST alone did not affect AS-stimulated Tiam-1 exchange 
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activity, both of the Tiam-1 GST-PH domain fusions reduced the effectiveness 
of AS (Fig. 6). 

Without further elaboration, it is believed that one skilled in the art can, 
using the preceding description, utilize the present invention to its fullest 
5 extent. The preceding preferred specific embodiments are, therefore, to be 

construed as merely illustrative, and not limitative of the remainder of the 
disclosure in any way whatsoever. 

The entire disclosure of all patents/patent applications and 
publications, cited above and in the figures are hereby incorporated by 
10 reference in the entirety. 

From the foregoing description, one skilled in the art can easily 
ascertain the essential characteristics of this invention, and without departing 
from the spirit and scope thereof, can make various changes and modifications 
of the invention to adapt it to various usages and conditions. 
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SEQUENCE LISTING 



15 



45 



60 



65 



70 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Bollag, Gideon 
Crompcon, Anne 
North, Anne 
Roscoe, William 
10 Sharma. Sanju 



(ii) TITLE OF INVENTION: Methods and Compositions for Treating 

Abnormal Cell Growth Related to Unwanted Guanine Exchange 
Factor Activity 

•(iii) NUMBER OF SEQUENCES; 2 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: ONYX Pharmaceuticals, Inc. 

20 (B) STREET: 3031 Research Drive 

(C) CITY: Richmond 

(D) STATE: CA 

(E) COUNTRY: US 

(F) ZIP: 94806 

25 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

iC) OPERATING SYSTEM: PC-DOS/MS-DOS 

30 (D) SOFTWARE: PatentIn Release #1.0, Version #1.30 

{vi) CURRENT APPLICATION DATA: 

(A) APPLICAT ON NUMBER: US Unknown 

(B) FILING DATE: 17-JUN-1997 

35 (C) CLASSIFICATION: Provisional 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Giotta Ph.D., Gregory 

(B) REGISTRATION NUMBER: 32,028 

40 (C) REFERENCE/ DOCKET NUMBER: ONYX1028 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (510) 262-8710 

(B) TELEFAX: (510) 222-9758 



(2) INFORMATION FOR SEQ ID N0;1: 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 3171 base pairs 

(B) TYPE: nucleic acid 
to STRANDEDNESS : single 
(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 76. .2208 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
TCACTCAAAC CAGTGAAGCT TGGGAAAGTC ATTGACCTCC AGTCGTTCTG CTGAGAAACA 60 
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TCTGGCTCTA TTTCC ATG GAG ACC AGG GAA TCT GAA GAT TTG GAA AAG ACC 111 
Met Glu Thr Arg Glu Ser Glu Asp Leu Glu Lvs Thr 
1 5 10 ' 

5 CGG AGG AAA TCA GCA AGT GAT CAA TGG AAC ACT GAT AAT GAA CCA GCC 159 

Arg Arg Lys Ser Ala Ser Asp Gin Trp Asn Thr Asp Asn Glu Pro Ala 
15 20 25 

in f,'^? ^ ^^"^ "^^^ ^'^'^ GAA AAA GAG GAG ACT TCT CAA GCT 207 

lU Lys Val Lys Pro Glu Leu Leu Pro Glu Lys Glu Glu Thr Ser Gin Ala 

30 35 40 

GAC CAG GAT ATC CAA GAC AAA GAG CCT CAT TGC CAC ATC CCA ATT AAG 255 
,r Asp Lys Glu Pro His Cys His He Pro He Lys 

*15 50 55 Iq 



20 



25 



30 



40 



50 



65 



70 



AGA AAT TCC ATC TTC AAT CGC TCC ATA AGA CGC AAA AGC AAA GCC AAG 
Arg rtsn Ser He Phe Asn Arg Ser He Arg Arg Lys Ser Lys Ala Lys 
65 70 75 



303 



351 



399 



GCC AGA GAC AAC CCC GAA CGG AAC GCC AGC TGC CTG GCA GAT TCA CAG 
Ala Arg Asp Asn Pro Glu Arg Asn Ala Ser Cys Leu Ala Asp Ser Gin 
SO 85 90 

GAC A.nT GGA AAA TCT GTA AAT GAG CCC CTG ACC TTG AAT ATC CCC TGG 
Asp „3n Gly Lys Ser Val Asn Glu Pro Leu Thr Leu Asn He Pro Trp 
95 100 105 

AGC AGA ATG CCT CCT TGC AGA ACA GCA ATG CAG ACA GAC CCA GGA GCC 447 
Ser Arg Met Pro Pro Cys Arg Thr Ala Met Gin Thr Asp Pro Gly Ala 
110 115 120 

rtn rt^ '^^^ '^^^ '^^^ AAT GGG GCC ACG CCC 495 

Gin Glu Met Ser Glu Ser Ser Ser Thr Pro Gly Asn Gly Ala Thr Pro 
130 135 3^40 

GAG GAG TGG CCG GCC CTG GCC GAC AGC CCC ACC ACG CTC ACC GAG GCC 54 3 

Glu Glu Trp Pro Ala Leu Ala Asp Ser Pro Thr Thr Leu Thr Glu Ala 
145 150 155 

CTG CGG ATG ATC CAC CCC ATT CCC GCC GAC TCC TGG AGA AAC CTC ATT 
Leu Arg Met He His Pro He Pro Ala Asp Ser Trp Arg Asn Leu He 
160 165 170 

45 



591 



fu, <5AT AAA TCG ACT CTC 639 

Glu Gin He Gly Leu Leu Tyr Gin Glu Tyr Arg Asp Lys Ser Thr Leu 
175 180 185 

tT^ '^^^ '^AT GCA GAA ATA GAA GAC AAT 687 

Gin Glu He Glu Thr Arg Arg Gin Gin Asp Ala Glu He Glu Asp Asn 
190 195 200 

TH^ ^'^ c*^^ n'^'^ '^^'^ ^''^ ''^^ <5AG GAG GAA GAA GAA 735 

Tnc ^ ^^"^ '^^^ A=P "^"^ P>^«> Glu Glu Glu Glu Glu 

205 210 215 220 

ctu rfn r^? ^^'^ "^^"^ ^"^^ GAG AGG AAG ACT CTG 783 

Glu Glu Glu Glu Glu Glu Pro Ala Ser Pro Pro Glu Arg Lys Thr Leu 

60 "0 235 

III r^n T?*^ '^^^ TCA AGG TTC AAC CTC TGG 831 

Pro Gxn He Cys Leu Leu Ser Asn Pro His Ser Arg Phe Asn Leu Trp 
240 245 250 

CAG GAT CTT CCC GAG ATC CGG AGC AGC GGG GTG CTT GAG ATC CTA CAG 
Gin «sp Leu Pro Glu He Arg Ser Ser Gly Val Leu Glu He Leu Gin 
255 260 265 

CCT GAG GAG ATT AAG CTG CAG GAG GCC ATG TTC GAG CTC GTC ACT TCC 927 

Pro Glu Glu He Lys Leu Gin Glu Ala Met Phe Glu Leu Val Thr Ser 
"'0 275 280 

40 



879 



GAG GCG TCC TAC TAC AAG AGT CTG AAC CTG CTC GTG TCC CAC TTC ATG 
Glu Ala Ser Tyr Tyr Lys Ser Leu Asn Leu Leu Val Ser His Phe Met 
285 290 295 300 

GAG AAC GAG CGG ATA AGG AAG ATC CTG CAC CCG TCC GAG GCG CAC ATC 
Glu Asn Glu Arg He Arg Lys He Leu His Pro Ser Glu Ala His He 
305 310 . 315 

CTC TTC TCC AAC GTC CTG GAC GTG CTG GCT GTC AGT GAG CGG TTC CTC 
Leu Phe Ser Asn Val Leu Asp Val Leu Ala Val Ser Glu Arg Phe Leu 
320 325 330 

CTG GAG CTG GAG CAC CGG ATG GAG GAG AAC ATC GTC ATC TCT GAC GTG 
Leu Glu Lgu Glu His Arg Met Glu Glu Asn He Val He Ser Asp Val 
335 340 345 

TGT GAC ATC GTG TAC CGT TAT GCG GCC GAC CAC TTC TCT GTC TAC ATC 
Cys Asp He Val Tyr Arg Tyr Ala Ala Asp His Phe Ser Val Tyr He 
350 355 360 

ACC TAC GTC AGC AAT CAG ACC TAC CAG GAG CGG ACC TAT AAG CAG CTG 
Thr Tyr Val Ser Asn Gin Thr Tyr Gin Glu Arg Thr Tyr Lys Gin Leu 
365 370 375 390 

CTC CAG GAG AAG GCA GCT TTC CGG GAG CTG ATC GCG CAG CTA GAG CTC 
Leu Gin Glu Lys Ala Ala Phe Arg Glu Leu He Ala Gin Leu Glu Leu 
385 390 395 

GAC CCC AAG TGC AGG GGG CTG CCC TTC TCC TCC TTC CTC ATC CTG CCT 
Asp Pro Lys Cys Arg Gly Leu Pro Phe Ser Ser Phe Leu He Leu Pro 
400 405 410 

TTC CAG AGG ATC ACA CGC CTC AAG CTG TTG GTC CAG AAC ATC CTG AAG 
Phe Gin Arg He Thr Arg Leu Lys Leu Leu Val Gin Asn He Leu Lys 
. 415 420 425 

AGG GTA GAA GAG AGG TCT GAG CGG GAG TGC ACT GCT TTG GAT GCT CAC 
Arg Val Glu Glu Arg Ser Glu Arg Glu Cys Thr Ala Leu Asp Ala His 
430 435 440 

AAG GAG CTG GAA ATG GTG GTG AAG GCA TGC AAC GAG GGC GTC AGG AAA 
Lys Glu Leu Glu Met Val Val Lys Ala Cys Asn Glu Gly Val Arg Lys 
445 450 455 460 

ATG AGC CGC ACG GAA CAG ATG ATC AGC ATT CAG AAG AAG ATG GAG TTC 
Met Ser Arg Thr Glu Gin Met He Ser He Gin Lys Lys Met Glu Phe 
465 470 475 

AAG ATC AAG TCG GTG CCC ATC ATC TCC CAC TCC CGC TGG CTG CTG AAG 
Lys He Lys Ser Val Pro He He Ser His Ser Arg Trp Leu Leu Lys 
480 485 490 

CAG GGT GAG CTG CAG CAG ATG TCA GGC CCC AAG ACC TCC CGG ACC CTG 
Gin Gly Glu Leu Gin Gin Met Ser Gly Pro Lys Thr Ser Arg Thr Leu 
495 500 505 

AGG ACC AAG AAG CTC TTC CAC GAA ATT TAC CTC TTC CTG TTC AAC GAC 
Arg Thr Lys Lys Leu Phe His Glu He Tyr Leu Phe Leu Phe Asn Asp 
510 515 520 

CTG CTG GTG ATC TGC CGG CAG ATT CCA GGA GAC AAG TAC CAG GTA TTT 
Leu Leu Val He Cys Arg Gin He Pro Gly Asp Lys T-/r Gin Val Phe 
525 530 535 ■ 540 

GAC TCA GCT CCG CGG GGA CTG CTG CGT GTG GAG GAG CTG GAG GAC CAG 
Asp Ser Ala Pro Arg Gly Leu Leu Arg Val Glu Glu Leu Glu Asp Gin 
545 550 555 

GGC CAG ACG CTG GCC AAC GTG TTC ATC CTG CGG CTG CTG GAG AAC GCA 
Gly Gin Thr Leu Ala Asn Val Phe He Leu Arg Leu Leu Glu Asn Ala 
560 565 570 
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GAT GAC CGG GAG GCC ACC TAC ATG CTA AAG GCG TCC TCT CAG AGT GAG 1839 
Asp Asp Arg Glu Ala Thr Tyr Met Leu Lys Ala Ser Ser Gin Ser Glu 
575 580 585 

ATG AAG CGT TGG ATG ACC TCA CTG GCC CCC AAC AGG AGG ACC AAG TTT 1887 
Met Lys Arg Trp Met Thr Ser Leu Ala Pro Asn Arg Arg Thr Lys Phe 
590 595 600 

GTT TCG TTC AC A TCC CGG CTG CTG GAC TGC CCC CAG GTC CAG TGC GTG 193 5 

Val Ser Phe Thr Ser Arg Leu Leu Asp Cys Pro Gin Val Gin Cvs Val 
605 610 615 ■ 620 

CAC CCA TAC GTG GCT CAG CAG CCA GAC GAG CTG ACG CTG GAG CTC GCC 1983 
His Pro Tyr Val Ala Gin Gin Pro Asp Glu Leu Thr Leu Glu Leu Ala 
625 630 635 

GAC ATC CTC AAC ATC CTG GAC AAG ACT GAC GAC GGG TGG ATC TTT GGC 2031 
Asp lie Leu Asn lie Leu Asp Lys Thr Asp Asp Gly Trp He Phe Gly 
640 645 650 

GAG CGT CTG CAC GAC CAG GAG AGA GGC TGG TTC CCC AGC TCC ATG ACT 207 9 

Glu Arg Leu His Asp Gin Glu Arg Gly Trp Phe Pro Ser Ser Met Thr 
655 660 665 

GAG GAG ATC TTG AAT CCC AAG ATC CGG TCC CAG AAC CTC AAG GAA TGT 2127 
Glu Glu He Leu Asn Pro Lys He Arg Ser Gin Asn Leu Lys Glu Cvs 
670 675 680 

TTC CGT GTC CAC AAG ATG GAT GAC CCT CAG CGC AGC CAG AAC AAG GAC 217 5 

Phe Arg Val His Lys Met Asp Asp Pro Gin Arg Ser Gin Asn Lvs Asp 
685 690 695 * 700 

CGC AGG AAG CTG GGC AGC CGG AAT CGG CAA TGA CCCCCACCCA GGGGGCCAGC 2228 
Arg Arg Lys Leu Gly Ser Arg Asn Arg Gin * 
705 710 

GGGAGCAGGG CCTGCATGAG ACCCCGACAG AAGGTGGGGG GGGGGGGGGG GGCtCTGGGA 228 8 

AGCACAGGCC AGCACCTCCC CAGGTGGCAG GATCTGGCTT GGGGTGCCCG GCCCTCATCC 234 8 

CTGCCCACGC AGTGAGTGCT CATGTGTCTT GGCCCCTTGC TCGCAAACTG GATAAAGGGT 2408 

GCCCAAGCCT CTCCTGATGC ATTTGTAAAC AAGAAGGTTT CAGCAGTATT ACACCACCTC 2468 

CCTCATGCCT CCGAGGGGGT GGAAGGGGGT GGGCACACTC CAGGGCCCCC CATGCCCCTG 2523 

GCCCCCAGGG ATTGGAAGAG GCTCCCAACC CAGAGTGTCC CTGTGGGAGG CAGGCAGAAG 2588 

GTGACAATTG ACACGATTTC CTGCACGCGT CTTCTTTTAC CTTGGAAGCA GTTAGAATTT 264 8 

ACCAGGCACA GATGAGGCCG CCCTTGCCTG ACGGAGCTTG ATGAGCAGCC CTTGGTCTCC 2708 

GGTTCCAGGA CTGAGAGCCC AGCTGCCTCT GCCCACCCTT CCCCAGGCCT CTGCCAGCCT 2768 

CTGGCTGCAC GGTCAGGCCC TGCCCCATGG CAGGCCTGCC AGAGCTTGGC TGGGGACCCC 282 8 

TCCCGCCTCT GGCTCCCTGA TGGGCTGGAT GTAACTTGTG TCTTCTAGCC CCTTAAGGAG 2888 

CCCAGGTGTT TTAAGGAATG AATTGGTCAC TGCATCTTGT ATCGATTATG GTTCTGAGAA 2948 

AAGCAAATAT CGGAATTCCT GCAGCCCGGG AAATGGGGCC ACGCCCGAGG AGTGGCCGGC 3008 

CCTGGCCGAC AGCCCCACCA CGCTCACCGA GGCCCTGCGG ATGATCCACC CCATTCCCGC 3068 

CGACTCCTGG AGAAACCTCA TTGAACAAAT AGGGCTCCTG TATCAGGAAT ACCGAGATAA 3128 

ATCGACTCTC CAAAAAAAAA AAAAAAAAAA GATCTTTAAT TAA 3171 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

42 
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■ (A) LENGTH: 711 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

5 (ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Glu Thr Arg Glu Ser Glu Asp Leu Glu Lys Thr Arg Arg Lvs Ser 
10 1 5 10 15 

Ala Ser Asp Gin Trp Asn Thr Asp Asn Glu Pro Ala Lys Val Lvs Pro 
20 25 30 

15 Glu Leu Leu Pro Glu Lys Glu Glu Thr Ser Gin Ala Asp Gin Asp lie 

35 40 45 



20 



35 



50 



65 



Gin Asp Lys Glu Pro His Cys His lie Pro He Lys Arg Asn Ser He 
50 55 60 

Phe Asn Arg Ser He Arg Arg Lys Ser Lys Ala Lys Ala Arg Asp Asn 
65 70 75 80 



Glu Arg Asn Ala Ser Cys Leu Ala Asp Ser Gin Asp Asn Glv Lys 
25 85 90 95 

Ser Val Asn Glu Pro Leu Thr Leu Asn He Pro Trp Ser Arq Met Pro 
100 105 110 

30 Pro Cys Arg Thr Ala Met Gin Thr Asp Pro Gly Ala Gin Glu Met Ser 

115 120 125 



Glu Ser Ser Ser Thr Pro Gly Asn Gly Ala Thr Pro Glu Glu Trp Pro 
130 135 140 

Ala Leu Ala Asp Ser Pro Thr Thr Leu Thr Glu Ala Leu Arg Met He 
145 150 155 160 



His Pro He Pro Ala Asp Ser Trp Arg Asn Leu He Glu- Gin He Gly 

40 165 170 175 

Leu Leu Tyr Gin Glu Tyr Arg Asp Lys Ser Thr Leu Gin Glu He Glu 
180 185 190 

45 Thr Arg Arg Gin Gin Asp Ala Glu He Glu Asp Asn Thr Asn Gly Ser 

195 200 205 



Pro Ala Ser Glu Asp Thr Pro Glu Glu Glu Glu Glu Glu Glu Glu Glu 
210 215 220 

Glu Glu Pro Ala Ser Pro Pro Glu Arg Lys Thr Leu Pro Gin He Cys 
225 230 235 240 



Leu Ser Asn Pro His Ser Arg Phe Asn Leu Trp Gin Asp Leu Pro 
55 245 250 255 

Glu He Arg Ser Ser Gly Val Leu Glu He Leu Gin Pro Glu Glu He 
260 265 270 

60 Lys Leu Gin Glu Ala Met Phe Glu Leu Val Thr Ser Glu Ala Ser Tyr 

275 280 285 



Tyr Lys Ser Leu Asn Leu Leu Val Ser His Phe Met Glu Asn Glu Arg 
290 295 300 

He Arg Lys He Leu His Pro Ser Glu Ala His He Leu Phe Ser Asn 
305 310 315 320 



Val Leu Asp Val Leu Ala Val Ser Glu Arg Phe Leu Leu Glu Leu Glu 
70 325 330 335 
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His Arg Met Glu Glu Asn lie Val lie Ser Asp Val Cys Asp lie Val 
340 345 

Tyr Arg Tyr Ala Ala Asp His Phe Ser Val Tyr lie Thr Tyr Val Ser 
355 360 365 

Asn Gin Thr Tyr Gin Glu Arg Thr Tyr Lys Gin Leu Leu Gin Glu Lys 
370 375 380 

Ala Ala Phe Arg Glu Leu He Ala Gin Leu Glu Leu Asp Pro Lvs Cvs 
385 390 395 " 400 

Arg Gly Leu Pro Phe Ser Ser Phe Leu He Leu Pro Phe Gin Arg He 
405 410 

Thr Arg Leu Lys Leu Leu Val Gin Asn He Leu Lys Arg Val Glu Glu 
420 425 430 

Arg Ser Glu Arg Glu Cys Thr Ala Leu Asp Ala His Lys Glu Leu Glu 
435 440 445 

Met Val Val Lys Ala Cys Asn Glu Gly Val Arg Lys Met Ser Ara Thr 
450 455 460 

Glu Gin Met He Ser He Gin Lys Lys Met Glu Phe Lys He Lvs Ser 
470 475 - 

Val Pro He He Ser His Ser Arg Trp Leu Leu Lys Gin Gly Glu Leu 
485 490 495 

Gin Gin Met Ser Gly Pro Lys Thr Ser Arg Thr Leu Arg Thr Lvs Lys 
500 505 510 * 

Leu Phe His Glu He Tyr Leu Phe Leu Phe Asn Asp Leu Leu Val He 
515 520 525 

cys Arg Gin He Pro Gly Asp Lys Tyr Gin Val Phe Asp Ser Ala Pro 
530 535 540 

Arg Gly Leu Leu Arg Val Glu Glu Leu Glu Asp Gin Gly Gin Thr Leu 
545 550 555 550 

Ala Asn Val Phe He Leu Arg Leu Leu Glu Asn Ala Asp Asp Arg Glu 
565 570 575 

Ala Thr Tyr Met Leu Lys Ala Ser Ser Gin Ser Glu Met Lys A'-g Trp 
580 585 590 

Met Thr ser Leu Ala Pro Asn Arg Arg Thr Lys Phe Val Ser Phe Thr 
595 600 605 

Ser Arg Leu Leu Asp Cys Pro Gin Val Gin Cys Val His Pro Tyr Val 
610 615 620 

Ala Gin Gin Pro Asp Glu Leu Thr Leu Glu Leu Ala Asp He Leu Asn 
625 630 635 640 

He. Leu Asp Lys Thr Asp Asp Gly Trp He Phe Gly Glu Arg Leu His 
645 650 655 

Asp Gin Glu Arg Gly Trp Phe Pro Ser Ser Met Thr Glu Glu He Leu 
660 665 670 

Asn Pro Lys He Arg Ser Gin Asn Leu Lys Glu Cys Phe Arg Val His 
675 680 685 

Lys Met Asp Asp Pro Gin Arg Ser Gin Asn Lys Asp Arg Arg Lys Leu 
690 695 « a ^ 



700 



Gly Ser Arg Asn Arg Gin * 
705 710 
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What is claimed : 

1. An isolated Rac-GEF polypeptide or a biologically-active fragment 
thereof. 

2. An isolated Rac-GEF, or a biologically-active fragment thereof, of claim 
5 1, wherein said polypeptide has a guarune nucleotide exchange activity, 

a specific binding affinity for a guanine nucleotide depleted Rac, or a 
cellular oncogenic transforming activity. 

3. An isolated Rac-GEF or a biologically-active fragment thereof of claim 
1 which is of human. 

10 4. An isolated Rac-GEF of claim 1 comprising amino acid 1 to amino acid 

711, as set forth in Fig. 1 (SEQ. ID NO: 2). 

5. An isolated biologically-active fragment of Rac-GEF of claim 4 which 
comprises amino acids 273-605. 

6. An isolated Rac-GEF, or a biologically-active fragment thereof, of claim 
15 1, which is substantially purified. 

7. An isolated nucleic acid comprising a nucleotide sequence coding for a 
Rac-GEF polypeptide. 

8. An isolated nucleic acid of claim 7, wherein said coded for polypeptide 
has a guanine nucleotide exchange activity, a specific binding affinity 

20 for a guanine nucleotide depleted Rac, or a cellular oncogenic 

transforming activity. 

9. An isolated nucleic acid of claim 7 which is human. 

10. An isolated nucleic acid of claim 7, wherein the nucleic acid sequence 
codes for amino acid 1 to amino acid 711, as set forth in Fig. 1 (SEQ. ID 

25 NO: 2). 

11. An isolated nucleic acid of claim 7, wherein the nucleotide sequence is 
operably linked to an expression control sequence. 

12. An isolated nucleic acid of claim 7, wherein the nucleic acid comprises a 
naturally-occurring nucleotide sequence. 

30 13. An isolated nucleic acid of claim 7, wherein the nucleic acid codes for 

said polypeptide without interruption. 
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An isolated nucleic acid of claim 7, wherein the nucleic acid is DNA or 
RNA. 

An isolated nucleic acid of claim 7, wherein the nucleic acid further 
comprises a detectable label. 

An isolated nucleic acid of claim 7, except where one or more amino 
acid positions are substituted or deleted, or both, and the polypeptide 
coded for by the nucleic acid is biologically-active. 
An isolated nucleic acid of claim 16, wherein the biological activity is a 
guanine nucleotide exchange activity, a specific binding affinity for a 
guanine nucleotide depleted G-protein, or a cellular oncogenic 
transforming activity. 

An isolated nucleic acid of claim 16, wherein the one or more 
substituted amino acid positions are substituted by conservative amino 
acids. 

An isolated nucleic acid of claim 16, wherein the one or more 
substituted amino acid positions is in the Dbl homology domain or the 
pleckstrin homology domain. 

An isolated nucleic acid comprising a nucleotide sequence which 
hybridizes, or whose nucleic acid complement hybridizes, under 
stringent conditions to base pairs of nucleotide sequence 900-1482 as set 
forth in Fig. 1 (SEQ. ID NO: 1). 

An isolated nucleic acid claim 20 comprising at least 95% nucleotide 
sequence identity to the nucleotide sequence set forth in claim 20. 
An isolated nucleic acid of claim 20, wherein said nucleic acid codes for 
a polypeptide having a guanine nucleotide exchange activity, a specific 
binding affinity for a guanine nucleotide depleted Rac, or a cellular 
oncogenic transforming activity. 

An isolated nucleic acid comprising a nucleotide sequence which is 
unique to Rac-GEF. 
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24. An isolated nucleic acid comprising a nucleotide sequence which 
hybridizes, or whose nucleic acid complement hybridizes, under 
stringent conditions to the unique nucleotide sequence of claim 23. 

25. An isolated nucleic acid of claim 24 which codes for a polypeptide 
5 having a guanine nucleotide exchange activity, a specific binding 

affinity for a guanine nucleotide depleted Rac, or a cellular oncogenic 
transforming activity. 

26. A method of expressing in transformed host cells, a Rac-GEF 
polypeptide coded for by a nucleic acid, comprising culturing 

10 transformed host cells containing a nucleic acid according to claim 7 

under conditions effective to express the polypeptide. 

27. A method of expressing, in transformed host cells, a polypeptide coded 
for by a nucleic acid, comprising culturing transformed host cells 
containing a nucleic acid according to claim 20 under conditions 

15 effective to express the polypeptide. 

28. A method of claim 26, further comprising isolating the polypeptide. 

29. A method of claim 26, further comprising modulating expression of the 
polypeptide. 

30. An isolated polypeptide produced by a method of claim 26. 
20 31. An isolated polypeptide produced by a method of claim 27. 

32. A transformed host cell containing a nucleic acid of claim 7. 

33. A transformed host cell containing a nucleic acid of claim 20. 

34. A vector comprising a nucleic acid of claim 7. 

35. A vector comprising a nucleic acid of claim 20. 

25 36. A method of modulating an activity of a Rac polypeptide comprising, 

administering an effective amount of a Rac-GEF polypeptide or a 
biologically-active fragment thereof, or an effective amount of a 
compound which modulates the activity of the Rac-GEF. 
37. A method of claim 36, wherein the Rac-GEF, or biologically-active 

30 fragment thereof, comprises an amino acid sequence which has a 



47 



PCT/US98/12391 



Specific binding activity for a guanine nucleotide depleted state of said 
Rac. 

A method of modulating an activity of a Rac polypeptide comprising; 

introducing a nucleic acid of claim 21 into said cell under 
conditions whereby said nucleic acid is expressed in an effective 
amount to modulate said activity of Rac in said cell. 
A method of claim 38 wherein said nucleic acid oncogenically 
transforms said cell. 

A method of isolating a molecule that binds to a guanine nucleotide- 
depleted state of a Rac polypeptide comprising; 

contacting a Rac polypeptide with a medium comprising said 
molecule under conditions effective for said molecule to bind to said 
Rac polypeptide; and 

separating said Rac polypeptide to which said molecule has 
bound from said medium. 

A method of claim 40, wherein said molecule is Rac-GEF. 

A method of claim 40, wherein said molecule has a molecular weight of 

about 82.5 kilodaltons. 

A method of claim 40, further comprising separating said molecule 
from said Rac polypeptide. 

A method of modulating an activity of a GTPase comprising, 
administering an effective amount of a guanine nucleotide exchange 
factor or a biologically-active fragment thereof, or an effective amount 
of a compound which modulates the activity of the guanine nucleotide 
exchange factor. 

A method of claim 44, wherein the guanine nucleotide exchange factor, 
or biologically-active fragment thereof, comprises an amino acid 
sequence which has a specific binding activity for a guanine nucleotide 
depleted state of said GTPase. 
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46. A method of testing for an agent which modulates the guanine 
nucleotide exchange activity of a guanine nucleotide exchange factor 
comprising: 

contacting a mixture of (a) a polypeptide comprising a guanine 
5 nucleotide exchange factor, or a biologically-active fragment thereof, 

and (b) a polypeptide comprising a GTPase, or a biologically-active 
fragment thereof, to which the exchange factor can bind, with an agent; 
and 

assaying for the presence or amount of guanine nucleotide 
10 exchange activity in the presence or absence of a GEF enhancer. 

47. A method of claim 46, wherein the GTPase is Rac. 

48. A method of claim 47, wherein the guanine nucleotide exchange factor 
is Rac GEF and paid GEF enhancer is ascorbyl stearate. 

49. An agent identified by the method of claim 48. 

15 50. A method of testing for an agent which modulates the binding between 

a guanine nucleotide exchange factor and a GTPase comprising: 

contacting a mixture of (a) a polypeptide comprising a 
guanine nucleotide exchange factor, or a biologically-active fragment 
thereof, and (b) a polypeptide comprising a GTPase, or a biologically- 

20 active fragment thereof, to which the exchange factor can bind, with an 

agent; and 

detecting the presence or amount of binding between the 
guanine nucleotide exchange factor polypeptide, or the biologically- 
active fragment thereof, and the GTPase. 
25 51. A method of claim 50, wherein the GTPase is Rac. 

52. A method of claim 50, wherein the guanine nucleotide exchange factor 
is Rac-GEF. 

53. An isolated agent identified by the method of claim 50. 

54. An isolated antibody which is specific for a Rac-GEF or a peptide 
30 comprising a sequence present therein. 
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An isolated antibody of claim 54, which binds to an amino acid 
sequence selected from the group consisting of 

H,NAFRELIAOLELDPKCOOH 

H,NYQERTYKLPFSSFLCOOH 

H,NPQRSQNKDRRKLGSRNRQCOOH 
A method of increasing the guanine nucleotide exchange activity of a 
guanine nucleotide exchange factor, or a biologically-active fragment 
thereof, said factor capable of acting on a member of the Ras 
superfamily of GTPases, comprising the steps of: 

contacting said guarune nucleotide exchange factor, or a 
biologically-active fragment thereof, with said member of the Ras 
superfamily of GTPases, or a biologically-active fragment thereof; and 

assaying for guanine nucleotide exchange activity under 
appropriate conditions in the presence of a guanine nucleotide 
exchange factor enhancer. 

A method as described in claim 56 wherein said member of the Ras 
superfamily of GTPases is Rac and said guanine nucleotide exchange 
factor is Tiam-1. 

A method as described in claim 57 wherein said GEF enhancer is 
selected from the group consisting of ascorbyl stearate, ascorbyl 
palmitate, and phosphoinositol. 

A method as described in claim 58 wherein said phosphoinositol is 
selected from the group consisting of PI3,4,5P3, PI4,5P2 and PI4P. 
A method of assaying for a compound to treat disease resulting from 
increased guanine nucleotide exchange activity of a guanine nucleotide 
exchange factor, or a biologically-active fragment thereof, said factor 
acting on a member of the Ras superfamily of GTPases, comprising the 
steps of: 

contacting said guanine nucleotide exchange factor, or a 
biologically-active fragment thereof, with said member of the Ras 
superfamily of GTPases, or a biologically-active fragment thereof; 

50 
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assaying for guanine nucleotide exchange activity under 
appropriate conditions in the presence of a guanine nucleotide 
exchange factor enhancer, and in the presence and absence of said 
compound; and 

determining if said compound decreases said guanine nucleotide 
exchange activity. 

A method as described in claim 60 wherein said member of the Ras 
superfamily of GTPases is Rac. 

A method as described in claim 61 wherein said guanine nucleotide 
exchange factor is Tiam-1. 

A method as described in claim 62 wherein said GEF enhancer is 
selected from the group consisting of ascorbyl stearate, ascorbyl 
palmitate, and phosphoinositol. 

A method as described in claim 63 wherein said phosphoinositol is 
selected from the group consisting of PI3,4,5P3, ?14,5?^ and PI4P. 
Compounds of claim 60 that decrease said guanine nucleotide exchange 
activity. 

Ligands that bind to the Src homology 3 domain on Rac-GEF. 
Ligands that bind to the Src homology 3 domain on Rac-GEF identified 
by the methods of claims 46 or 50. 
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Liver Rac GEF Map (1 > 3171) Site and Sequence 
Enzymes : All 440 enzymes (No Filter) 
Settings: Linear. Certain Sites Only, Standard Genetic Code 



--ACTCAAACCAGTGAAGCT-3GGAAAGTCATTGACCTCCA3TCG-7CTGC-GAGAAACATCTGGCTC7ATTTCC 

- " ■ ■ - - I . . . . r ... 1 I ' ■ 75 

;agac:agggaatctgaa3atttggaaaagacc:ggaggaaatcagcaagtga7Caatgsaacac:gataa7 

. 1 1 I ■ r ■ ■ I — ■ ' ' ' ' ' ' 1 50 



'a.a. M27< 



' Liver Specific Sequence ■ 



^et Giu Thr Arg Giu Ser G!l Asd Leu Giu Lys Thr Arg Arg Lys 5er Aia Ser AsD Gin Tro Asn Thr Aso Asn 

"aaccagccaagg:3aaacc'3agttactcc:agaaaaagaggagacttctcaagctgaccaggatatc:aagac 

■ ■ < 1 ' ' ' ' ' — ' 225 



"a.a. 1-127' 



■Liver Specific Sequence- 



- - ^ o >^ o v- 



o A:a Lys Vc Lys P'-c G.u leu Leu ^-o Gii. Lys 3:u ~".r Ser 3."* A!a Asp G!n Aso ile 3!n Asd 
ATT3::ACA':r:AA'TiAGAGAAA*':.:A*:'":iATCGC*::ATAAGACGCAAAAGCAAAGC ' 



^ 300 



'a.a. M27I 



— Liver Specific Sequence — — 

.5 G.u r-c ^^;s Cys ^;s ile -^o *le Lvs .Arg Asr Ser iie rr^e Asn Arg Ser ile Arg Arg Lys Ser Lys Aio 
r:3G::A3AGACAA:::CGAA:3GAACGCCAG::G:::3GCAGA"'-:ACAGGACAATGGAAAATCTGTAAA7GAG 



375 



'a.a. 1-127' 



•Liver Specific Sequence- 



r's Aia Arg AsD As-^ =^ro Giu Arg Asn AiO Ser Cys Leu AiO Aso Ser Gin Aso Asn Gly Lys Ser Vol Asn Giu 
-"-TGACCTTGAATATCCCC'GGAGCAGAATGCC'CCT-GCAGAACAGCAATGCAGACAGACCCAGGAGCCCAG 

! .1 - I I I . . — . — _j —I — —I — ■ ' — ' ' ' 450 



ia.a. M27' 



■Liver Specific Sequence- 



=-0 Leu r-r Leu Asn ile Pro Trp Ser Arg Met Pro Pro Cys Arg Thr Ala Vet Gin Thr Aso Pro Gly Ala Gin 
3AAATGAGTGAGTCGTCCTCCACCCCGGGAAATGGGGC:AC3CCCGAGGAG-3GCCGGCCCTGGCCGACAGCCCC 
. . . , ' I , I I 1 — ' ' ' ■ — ^— *- OdO 

" u Met Ser Giu Ser Ser Ser Tr.r Pro Gly Asn Giy Ala Tr- Pro Giu Giu Tro Pro Ala Leu Ala Aso Ser Pro 
rcCACGC'CACCGAGGCCCTGCGGATGATCCACCCCATTCCCGCCGACTCC'GGAGAAACCTCATTGAACAAATA 

Thr Leu Thr Giu Ala leu Arg iVet lie His Pro He Pro Ala Asp Ser "ro Arg Asn Leu He Giu Gin He 
GGGCT'^CTGTATCAGGAATACCGAGATAAATCGACTCTCCAAGAAATCGAAACCAGGAGGCAACAGGATGCAGAA 

3.V Leu Leu Tyr Gin Giu Tyr Arg Asp Lys Ser Thr Leu Gin Giu He Giu "nr Arg Arg Gin Gin Aso Ala Giu 
r-AGAAGACAATACCAATGGGTCCCCGGCCAGTGAGGACACCCCGGAGGAGGAAGAAGAAGAGGAGGAGGAGGAG 



• a.a. 216-226' 



" Poly-Glutamale Regior^- 



' e Giu AsD Asn Thr Asn Gly Ser Pro Ala Ser Giu Asp Thr Pro Giu Giu Giu Giu Giu Giu Giu Giu Giu Giu 
GAGCCGGCCAGCCCACCAGAGAGGAAGACTCTGCCCCAGATCTGCCTGCTCAGTAACCCCCACTCAAGGTTCAAC 

•a.]Ha.a 227-231 HI 
•Po^- SHS -Rinding Site 
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r-^T':r:ranGATCTTCCCGAGA,CLooAGLAouoooo.Gv-. ^^um ; _^ ^ 



900 



f 




-•.-AArr.A:.:GGATAAGGAAG--:: oCA.-... .....oG--.:.^. -: ^ , _ u 



1050 



— a. a. 273-456 
• Dbt HomcHogv Domaift 



is- 3-- Ara :ie --.s : 5 -5 .V. 



- 5 ::e Le.; ?-e -^s" ^--'"^ '^I'f't 

;;igaaca':3':a~:t:'Ga:g:g~gtga: 



•■ 125 



• a a. 173-456 • 
■ Dbl HorY^>o9y Dorriain- 



I - . - , - . - s A'Q Giv. Glu Asn :ie Voi i!e Ser Aso Vol >.ys Asp 
.a vo Ser G.. A-"? ^^'^ ^t^lr^l^.^^^^r^- : A-,%::,---,cGTCAGCAATC. GACCTAC^ 
•-r:--;TA:CGTTA-GCGG-^o-L^->vT.. - . -.avft.-^.w ^ 



1200 



iaa. 273-456' 
• Dbl Hoinolofly Domain- 




1275 



■ a.a. 273-'i56' 
■ Dbi Hornoiogv Doma:n- 



- ,,s Le. ^ Gl" L« ..o .» P^, V, L.. ;i. >loa.oUuGI.Uu».pPnoLvsCvs 



Dbl insert Region- 



,,.,,,,,,,,cCTCATCC- GCCTTTCCAGAGGATCACACG CCTCAAGCTGTTGGTCCAGAAC ^^^^ 



1 3. a. 273-^56' 
- Obi Hcrr.oiogy Domain 



* 1 




1425 



■ ?i.a. 273-156' 
■ Dbi Hcrr.oicgy Domain ■ 
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a.a. 273-455 I 
• Dbi Homology Domain ^ 

Voi Lys A:a Cys Asn G!u Gly Vol Arg Lys Met Ser Arg Thr Giu Gin Met ile Ser ile Gir Lys Lys Met Glu 
TTCAAGATCAAGTCGGTGCCCATCATCTGCCACTCCCGCTGGCTGCTGAAGCAGGGTGAGC'GCAGCAGATGTCA 



Phe Lys He Lys Ser vol Fro Ue Ue Ser Hts Ser Arg Trp Leu Leu Lys Gin Gly Glu Leu Gin Gin Met Ser 
ri-C^CAAGACCTCCCGGACCC'GAGGACCAAGAAGGTCTTCCACGAAATTTACCTCTTCCTGTTCAACGACCTG 



!575 



1650 



Gly Pro Lys Thr Ser Arg Thr Leu Arg 'r.r Lvs Lys Leu Phe H<s He Tyr Leu Phe Leu Fhe Asn Asd Leu 
r---A-:-GCCGG:AGAT'::^GGAGA:^^GTAC:AGGTAT-GA:::AGC-::3CGGGGArGCTGCG:GT ^^^^ 



Le - '3 . e - V s N. 



i -3 A-g G V -ej Lej Arg voi 



:300 



G;o G:u Le-j G!u AsD G^n Giy 

gaggc:ac:tacatg:taaagg:g 



G Thr Leu A-o A-n Vol Pr.e ile Leu Arg Leu Leu Glu Asr. A:a Asp Asd Arg 



tcagagtgagatgaagcgttggatgagctcactgggccc 



CAACAGG 



1875 



G u Ala Trr Tyr Met Leu Lys Ala Ser Ser G:r. Ser Glu Met Lys Arg Trp Met Thr Ser Leu Ata Pro Asn Arg 
:t^::.TTTnTTTCGTTCACATCCCGGCTGCTGGACTGCCCCCAGGTCCAGTG:GTGC^^ ^^^^ 



>a.a. 608-680" 



- Src Homology 3 Domain - 



Am Thr Lvs Phe Vol Ser Phe Thr Ser Arg Leo Leu Asp Cys Pro G:n Vol Gin Cys Vol H:s Pro Tyr Vol Ata 
;^:^;:;anArr!!!;c;GACGCTaGAGCT^GC:GACATCCTCAACATCCTaGACAAGACTGACGACGGGTGGATC 



2025 



"a.a. 608-680' 
• Src Homology 3 Domain- 



nn rin Pro ASD Glu Leu Thr Leu Glu Leu Aia Asd lie Leu Asn lie Leu Aso Lys Thr AsD Asp Gly Trp He 



• a.a. 608-680' 



■ Src Homology 3 Domain- 



DK - Leu H.s Aso Gin Glu Arg Gly Tro Phe Pro Ser Ser Met Thr Giu Glu He Leu Asn Pro Lys 

^tcS^c^aI^ckaI^^atgttkcc^gtccacaa gatggatgaccctcagcgcagccagaacaaggac^.^^ 

■■a.a. 608-680 H 

- Src Homology 3 D . r-, » i ^ ao„ 

•i« Am Gin Asn Leu Lys Glu Cys Phe Arg Vol His Lys Met Asp Asp Pro Gin Arg Ser Gin Asn Lys Asp 
r'lrrAG'GATGcTGGGCAGCc TGAATcLcAATGACCCCCACCCAGGGGGCCAGCGGGAG CAGGGCCTGCATGAGA^ ^^^^ 

__J ^ 1 ' • ^ ^ ' ' ' ' 

FIGURE 1c 
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Brain-Specific Sequence Map (1 > 198) Site and Sequence 
Enzymes • All 440 enzymes (No Filter) 
Settings- Linear. Certain Sites Only. S tandard Genetic Code 



.;T^^rCGCAGCC:GTTAGTCGCCCCCGACCCA GCCCAGGGCCCCGGCG?^^^GACCCGGCCCCAGCACC 



•Brain Specific Sequence- 



^^^MMB^^MMMMI^^BMMMHa.a. 1-66 



I ^ 



Fro Z---- ?-o Vo: Ser Ara ^.''o Arc 5er Pro G-v Pro Arg Arg G:y Pro Arg Pro Gly Pro Ser Th- 



-.rrrc:;CC3CAGA::-A-GA GCT3C:GGC:3:TGC:--:AGCGCCG CCTGC2CCGTGGACCACGACAGTTCC 

L ' ^ — ^ ' ' 

: Brain Specific Sequence ~ ~~I!~ZZZ" 



»a.a. 1-66* 



Arg Pro Ala Aio Asp Pro Met Gi u Leu Leu Alo Alg Ala Pne Ser Ala Alo Cys Ale Vol Asp H.s Asp Ser Ser 

■ ' ' ■ ' ' ' 

ACCTCGGAAAGCGACGCGCGCGACTCGGCGGCGGGACACCTGCCCGGC 
Brain Specific Sequence- 



Thr Ser Glu Ser Asp Ala Arg Asp Ser Ala Ala Gly H.s Leu Pro Gly 
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iPEST 



PH iCC iPDZ ;DH i PH 

ZZHD M — t\W-o 



178kD 



4IIHI}- 



135 kD 



106 kD 



K\\\.\t 



85 kD 



66 kD 



j.il.'40 - jaljyi 



Figure 3 
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Figure 4 
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Fold Aciivaiion 




Figure 5 
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